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' HAS RECENT RESEARCH UNDERMINED THE 


EVIDENCE FOR EXTRA SENSORY 
PERCEPTION 


CLARENCE LEUBA 
Antioch College 


WO books, one technical,’ the other popular,’ have de- 
described the astonishing results of experiments in 
telepathy and clairvoyance at Duke University. It ap- 

peared from these experiments that certain people at certain 
times could experience what was occurring in another person’s 
mind (telepathy), or some external event (clairvoyance), with- 
out the mediation of the senses, and with a degree of accuracy 
unexplainable on the basis of chance alone. This ability was 
called extra sensory perception. 

To control chance, the Duke University investigators devised 
the now famous extra sensory perception cards. Each card 
bears one of five symbols: a cross, a rectangle, a star, a circle, 
or wavy lines. A subject confronted with these five cards face 
down, and knowing that the five different symbols were repre- 
sented, had an even chance of guessing one correct out of the 
five. When, as was usually the case, the cards were used in 
packs of 25 (five crosses, five rectangles, five stars, five circles, 
and five wavy lines), the subject had an even chance of guess- 
ing 5 correctly out of the 25. 

About one in five of the persons tested at Duke University 
gave results significantly above those expected on a purely 
chance basis. A few of these selected subjects were able to 
average 10, or even more, correct calls per pack of 25 cards, 
for many thousands of trials. On a few occasions, all 25 cards 


1J. B. Rhine, Extra Sensory Perception, 1934. 
2J. B. Rhine, New Frontiers of the Mind, 1937. 
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were called correctly. The odds against a purely chance occur- 
rence of such phenomena were ‘‘astronomical.’’ For good sub- 
jects, it seemed to make little difference whether the experi- 
menter knew the order of the cards (telepathy combined with 
clairvoyance), or whether he did not (pure clairvoyance), or 
whether the experimenter just thought of the cards in some 
supposedly random manner (pure telepathy). 

These astounding results, secured in a scientific laboratory 
of a great university and publicized in two books, one of which 
was widely read,* stimulated both laymen and professional 
psychologists, as well as the Duke University investigators 
themselves, to repeat these experiments and to devise safe- 
guards against sources of error. 

To prevent the subject from securing sensory cues from the 
cards themselves it became standard practice, in psychological 
laboratories at least, not to permit the subject to shuffle the 
cards himself or to touch them in any way.* During the test 
proper, the cards were held behind a screen. Whenever the 
experimenter himself knew the order of the cards, and the test 
was therefore one involving telepathy, he was separated from 
the subject by doors and walls. It is well known to psycholo- 
gists that when a person thinks intently about a symbol, such 
as a square or a cross, the muscles involved in saying the ap- 
propriate word—square or cross—may be innervated (ideo- 
motor action) ; that the person himself may be unaware of this 
innervation; and that persons may be influenced by sound 
waves of which they are not consciously aware (subliminal 
cues). Finally, to prevent an experimenter, under the influ- 
ence of a desire for success, from misinterpreting a subject’s 
ambiguously expressed statement of the symbol or symbols 
chosen, subjects were instructed to point to a picture of the 
symbol they had in mind. 

8 New Frontiers of the Mind. 

4 With certain packs it was possible, either tactually or visually, to get 
cues from the imprint of the symbol through the back of the card; using 
these cues, one psychologist was able to name all 25 cards correctly. 
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As these safeguards against sensory cues and recording 
errors were introduced at the Duke Laboratory and elsewhere, 
the number of successful responses, though still remaining 
slightly above average chance expectations, showed a marked 
' decline. Even the best subjects reported upon in the June, 
1938, issue of the Journal of Parapsychology averaged only be- 
tween 6 and 7 correct responses per run of 25 cards. 

It occurred to us that these more modest deviations from 
chance expectations might be due to two factors, also not extra 
sensory in nature: 

1. The selection of unusually lucky subjects out of a large 
number of subjects who scored at or below the 5 hits per run 
expected on the average through chance. It is an established 
mathematical principle that the statistical formulae used by 
E.S.P. investigatcrs apply only when all the data for all the 
subjects are taken into account. Nevertheless, the subjects re- 
ported upon in the literature today are chosen from a much 
larger, but unknown, number of subjects who may have scored 
at or below the five expected on the average through chance, 
and whose data have not been reported. It is to be expected, 
on a purely chance basis, that if enough subjects are tested a 
few will score between 6 and 7 correct for many runs through 
the pack of 25 cards. 

2. A tendency on the part of successful experimenters to 
persevere at any one session, with subjects who had been lucky 
initially, until those subjects had had a good series of runs; 
and conversely, with subjects who were poor, to stop after a 
series of unsuccessful runs. It has been stated repeatedly in 
the Duke University publications that attitudes of patience 
and confidence were important for successful extra sensory 
experimentation. Just as a gambler with unlimited resources 
might refuse to stop at any session until he was ahead of the 
game, so an E.S.P. investigator, buoyed by faith, might refuse 
to give up with certain subjects until they were well ahead of 
the results expected, on the average, through chance. 
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To test empirically the influence of these two factors, it 
was necessary to devise an experiment in which chance alone 
could operate.’ For this purpose we matched two packs of 
E.S.P. cards with each other, ascribing the first 10 runs to a 
first imaginary ‘‘subject,’’ the second 10 runs to a second 
imaginary ‘‘subject,’’ and so on until we had 87 ‘‘subjects.”’ 
Those ‘‘subjects’’ who averaged above 5 hits-per run for those 
first 10 runs were given a second session; the few who did well 
during that second session, were given a third session, and so 
on. With unusually good ‘‘subjects,’’ we tended to stop after 
a good series of runs; with mediocre ‘‘subjects,’’ after a poor 
series of runs. The 87 ‘‘subjects’’ made 2,682 runs (67,050 
separate trials or matchings) and averaged close to the chance 
expectation of 5 hits per run (5.03). Seven ‘‘subjects,’’ how- 
ever, did have, at one time or another, 8 or more consecutive 
runs with scores which, according to the Handbook for Testing 
Extra Sensory Perception (by members of the staff of the 
Duke University Laboratory of Parapsychology), were clearly 
indicative of extra sensory powers—on the part of our cards! 
‘*Subject’’ B averaged 6.2 hits per run of 25 cards for his 41 
runs (1025 trials; critical ratio 3.8). 

This experiment provides an empirical verification of what 
one would expect, a priori, from the application of accepted 
mathematical principles: namely that as long as there are 
large numbers of people being tested for E.S.P. capacity, there 
are bound to be a few, who, through pure chance, will have a 
number of correct responses greater than the number expected 
on the average through chance. Likewise, if there are enough 
investigators in the field, there are bound to be a few investi- 
gators who will have an average number of correct responses 
for all their subjects slightly above the average chance expec- 
tation. 

That both the selection of lucky subjects and the tendency 
to persist with certain good subjects until they were above 


5 Leuba, C., ‘‘An Experiment to Test the Role of Chance in E.8.P. 
Research,’’ Journal of Parapsychology, 1938, 2, 217-221. 
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average chance expectations are effective, was shown by repeat- 
ing the experiment just described under two sets of circum- 
stances. If the experimenter was instructed to persevere with 
his ‘‘subjects’’ until above average scores were obtained, the 
'“*subjects’’ averaged above 5 hits per run. If the experi- 
menter was instructed to secure a large number of ‘‘subjects,’’ 
but, by giving each ‘‘subject’’ a predetermined number of 
runs (40), to prevent his own attitude from influencing the 
results, a few ‘‘subjects’’ were found who averaged distinctly 
above 5 hits per run for the total of 40 runs (1000 trials). In 
neither experiment, however, were the results as significant as 
in our first experiment when both factors were permitted to 
exercise their influence. 

To summarize: The large number of correct responses ob- 
tained in the early Duke experiments and reported in two 
recent books, fail to appear when sensory cues and recording 
errors are eliminated; the more modest successes of the later 
experiments, under better controlled conditions, can be ob- 
tained by the selection of lucky subjects and through the influ- 
ence of the experimenter’s attitude in determining the point 
at which the testing will be discontinued.*® 

* There is as yet little indication that the Duke investigators and their 
associates have been appreciably affected by the negative evidence sum- 
marized in this article. In recent months, they claim to have demonstrated 
experimentally the ability of certain subjects to foretell, before shuffling 
the cards, what would be their order after they have been shuffled, to a 
degree of accuracy slightly greater than the five correct calls expected on 
the average through chance (pre-cognition); and the ability of certain 
experimenters to get reliably better results from their subjects than other 
experimenters, presumably because of the latter’s inadequate attitudes. 
Both these recent developments could be ascribed, of course, to the non- 
E.8.P. factors mentioned in this article: the selection of unusually lucky 
subjects or experimenters, and the tendency on the part of successful 


experimenters to persevere until better than average chance results have 
been achieved. 











RESTANDARDIZATION OF THE A-S REACTION 
STUDY AS A PERSONNEL FORM 


RICHARD 8. SCHULTZ anp SYDNEY ROSLOW 
The Psychological Corporation, New York City 


E Allport Ascendance-Submission Study’ was designed 

to differentiate degrees of ‘‘initiative’’ in social contacts. 

A revision? for business use has been made in which a 

number of the original items have been eliminated and in which 

other items have been re-written or especially prepared in order 

to pertain directly to business rather than to academic situa- 
tions. 

LIMITATION OF THE BUSINESS REVISION 


The business revision has been used for some time at the Psy- 
chological Corporation in personnel research.** Its use has 
been restricted because the scores spread over a wide middle 
range, giving a flat distribution. Consequently the test does 
not discriminate between individuals so as to yield scores falling 
into a normal distribution. In an earlier investigation, the 
authors analyzed the results of the test administered to 544 
adult men with varied employment experience who were appli- 
cants for life insurance work. The distribution shown in Table 
I (Business Revision) is flat between +19 and —8 (the second 
to the eighth decile). The distribution failed to yield a 

1G. W. Allport. A Test for Ascendance-Submission. Journal of Abnor- 
mal and Social Psychology, 1928, 23, 118-196. 


2R. O. Beckman. Ascendance-Submission Test—Revised. The Person- 
nel Journal, 1933, 11, 387-392. 

8 Achilles, P. 8. and Schultz, R. 8. Characteristics of Life Insurance 
Salesmen. The Personnel Journal, 1934, 12, 260-263. 

4R. 8. Schultz. Standardized Tests and Statistical Procedures in Selec- 
tion of Life Insurance Sales Personnel. JOURNAL OF APPLIED PsYcHOL- 
oGy, 1936, 20, 553-566. 
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TABLE I 
Frequency Distributions of the A-S Reaction Test 
BUSINESS REVISION® RESTANDARDIZED SCALE** 
Scores f Scores f 
B7—BD on... ~ 1 190-199 ................ . 3 
BARBS a eeccssnne 4 180-189 .................. 5 
SS: 2 170-179 ................ i 15 
y) | | 5 160-169 .................. 15 
ps | * 16 150-159 .................. 31 
)  , 14 140-149 ................. 48 
| 24 130-139 ................. 59 
16-18 ................. 35 120-129 ................. 66 
TIMED Ginn 45 110-119 .................. 71 
10-12 35 100-109 ................. 62 
y | 50 90— 99 ................. 52 
BD eatin 48 80— 89 ................. 28 
WD: cabin tans 48 gh nae 3 24 
BeOS poe 46 60— 69 ................ 13 
- &-— 8 ...... . 48 50— 69 ................ 7 
- &-— 6 ...... . 37 Ge EP din 1 
-ll-- 9... 27 —_ 
-14—- =12 ...... . 21 N 500 
-17- —16 ...... es 18 M 119.3 
— 20—- —18 ....... 6 8.D. 29.0 

— 23— —211 ........ 8 

— 26—- -24 . 3 

— 29- -27 ....... 1 

— 32— —30 ....... 1 

— 35- — 33 ........ 1 

N 544 

M 46 
8.D. 12.5 








* Distribution of scores obtained in a previous investigation. 
** Distribution of scores obtained in the present study. 


clearly defined modal interval. The flatness in the middle 
ranges suggests that this distribution corresponds to the rectan- 
gular type of normal curve described by Kelley.5 Others® 


5T. L. Kelley. Statistical Method. Macmillan, New York, 1923. 
68. H. Stevens and E. F. Wonderlic. The A-S Reaction Test, A Study 
of Beckman’s Revision. The Personnel Journal, 1934, 15, 222-224. 
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have also reported this situation in which the distribution has 
been flat over a wide range; namely, between + 22 and +1 (the 
first to the fifth decile). 


METHOD OF RESTANDARDIZATION 


The Business Revision of the A-S Reaction Study was ad- 
ministered to a new group of 500 adult men, applicants for 
sales positions. These men have had a wide variety of employ- 
ment experience in all types of occupations and business. Their 
ages ranged from 21 to 45, and their education from sixth 
grade to college graduate. The men were chosen at random 
from six cities ; New York, Philadelphia, Chicago, Syracuse, Bos- 
ton and Columbus. The tests were scored according to the key 
supplied with an earlier revision.? On the basis of these scores, 
the upper 27 per cent and the lower 27 per cent were selected. 
An item analysis was made using the upper and lower 27 per 
cent method attributed to Kelley.’ A new scoring system was 
derived and the 500 tests were rescored. The distribution of 
the revised scores is shown in Table I. 

An inspection of the table indicates that this distribution is 
decidedly more normal than the distribution of the Business 
Revision discussed above. The histogram for this frequency 
distribution and the theoretical normal curve fitted to these data 
have been omitted here. The goodness of fit was determined 
by the chi-square function. The value of P was found to be 
.98. Holzinger* states that unless P is .2 or more the fit should 
not be regarded as good. The goodness of fit in this case is 
unusual. Since the investigation is concerned not with the 
question of sampling but with the construction of a measuring 
device to yield scores which distribute normally, the value of 
P indicates that the latter aim has been achieved. 


7 J. A. Long and P. Sandiford. The Validation of Test Items. Bulle- 
tin No. 3, 1935. Department of Educational Research, University of 
Toronto. 

8K. J. Holzinger. Statistical Methods for Students in Education. 
Ginn & Co., Boston, 1928, p. 247. 
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GENERAL SUMMARY 


This study is the first step in the restandardization of the 
Business Revision of the A-S Reaction Study. The aim is to 
develop a personality test for adults to be used in personnel 
administration. The Business Revision has been limited in 
use because it spread the scores over a large range tending to 
give a rectangular distribution. The restandardization gave a 
more normal distribution. The next step in the research pro- 
gram now in progress is to accumulate data on other adult 
groups and to construct norms.* 


* Seoring procedure, reliability, and validity will be published subse- 
quently. 














THE MICHIGAN VOCATIONAL 
PREFERENCE LIST 


EDWARD B. GREENE anv VIRGINIA DAHLEM 
University of Michigan 


THE PROBLEM 


le types of Vocational Preference Lists are to be found. 
One type, called Form I in this report, has the items 
arranged in alphabetical order. It is illustrated by” 
Strong’s (1933) Vocational Interest Blank, Thurstone’s (1935) 
Vocational Interest Schedule, Manson’s (1931) Occupational 
Interests for Women, and many others. The other type, called 
Form II, has the items arranged by divisions which are designed 
to be homogeneous. This is illustrated by the Michigan Voca- 
tional Preference List, Table 4. Form II has divisional head- 
ings which may influence the responses to the subheadings by 
establishing a mental or emotional ‘‘set.’’ Form I has no 
divisional headings, hence the ‘‘set’’ for each response must 
vary, depending somewhat on the previous item. This requires 
a shift from the consideration of one occupation to another 
which is often far removed from it. Thus in Thurstone’s Voca- 
tional Interest Schedule the first four items are ‘‘actor, adver- 
tiser, architect, and army officer.’’ 

A serious obstacle to the widespread use of Form I is the 
expense of scoring. The scoring must be done with a different 
stencil for each occupation or factor. Strong (1933) prepared 
twenty-two stencils and Thurstone (1935), seven. From these 
a profile showing one’s relative likes and dislikes is prepared. 
To eliminate that difficulty Form II was designed so that a score 
could be obtained for each division by finding the proportion 
of items marked L (Like). This form may be quickly scored 
without the use of stencils. It is expected that when Forms 
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I and II have the same items they will give profiles which are 
substantially the same. Reasons for this expectation are: 


1. The analyses of Strong’s (1934) results showed that there 
was considerable similarity between a number of the scoring 
keys. Dwyer (1937) has taken these keys and found that the 
present 22 Strong patterns of occupational preferences and 3 
patterns of maturity, studiousness, and masculinity can be cal- 
culated with remarkable accuracy from only five stencils: those 
for, Physicist, Teacher, Life-insurance Salesman, Journalist, 
and Office Worker. 

2. Thurstone’s (1935) analysis showed that certain items were 
more highly saturated than others with particular factors. 
Hence a selection of items could be made which would represent 
each factor fairly adequately. 

3. Studies of the permanence cf interest by Franklin (1926), 
Fryer (1924), and Strong (1931) showed that while one’s 
choices of a particular occupation varied considerably, one’s 
main division of interest tended to remain fairly constant. 


In order to ascertain the effects of grouping, two forms were 
devised having the same items and directions and no duplicate 
items. In Form I the items were arranged alphabetically. In 
Form II the items were grouped under eleven headings. 


THE CHOICE OF ITEMS 


A list of items was desired which would include the names 
of the most important occupations and professions for both men 
and women, and have as little ambiguity as possible. A group 
of seven judges, graduate students in psychology, cooperated 
in the following procedure : 


1. The 283 most important occupations and professions in the 
1930 census were listed on the basis of frequency and popu- 
larity of choice as shown by previous check lists. 


2. Occupational names which seemed ambiguous were dis- 
earded. Thus, in place of ‘‘florist,’’ which refers either to one 
who sells flowers or one who grows flowers, ‘‘storekeeper’’ and 
‘‘nurseryman’’ were chosen. In place of ‘‘banker,’’ ‘“‘bank 
clerk’’ and ‘‘bank official’’ were chosen. 


3. Items which had nearly the same meaning were studied 
and in some cases one was eliminated. 
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From this work 181 items were finally selected. 


COMPARISONS WITH OTHER LISTS OF OCCUPATIONS 


In order to show the amounts of overlapping between various 
vocational lists, Table 1 was prepared. This shows that all 
the lists compared have approximately 70 per cent of items 
identical with, and 14 per cent of items similar to the list 
prepared in this study. Another 11 per cent of items from 
these lists were more general than the Michigan List including 


TABLE 1 
A Comparison of Items in Various Lists with the Present Study 





THUR- MAN- 
MICHIGAN — STONE CLEETON om 
FORM 





Men Women Men Men Women Women 


N%|N*|NZI|NS| N%S|NG 
69 69 | 87 68 70 | 56 77); 91 74} 111 70 
14 19| 24 19 13 8 11; 12 10| 26 16 
14 14; 12 9 8 ll 8 11| 15 12 8 5 
3 3 5 4 aR Re 5 4) 15 9 
100 100 | 128100 | 74 100 | 73100 | 123100 | 160 100 


























reporter, manufacturer, civil service employee, college professor, 
factory worker, author of technical book, public speaker, farmer, 
scientific research worker, magazine writer, etc. Approxi- 
mately 5 per cent of items were found to be more rare or not to 
constitute a well defined occupation. These included auc- 
tioneer, dean of women, minister’s assistant, taxidermist, gov- 
ernor of a state, labor arbitrator, notary public, etc. 

It is evident from this comparison that the Michigan List 
includes the important occupations for men and women which 
are listed in the other check lists studied. The Michigan list 
differs from the others in having a larger number of trade 
names and a smaller proportion of general items. These gen- 
eral items are, however, well represented in the present study 
in the form of more specific occupational names. It was hoped 
that this greater specificity would lead to more precision in 
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ratings and hence a better analysis of interest, particularly 
among persons in semi-skilled trades, where Paterson and 
Darley (1936) found that the Strong scale was not discrimi- 
native. 


THE CHOICE OF DIVISIONS 


An inspection of the items selected showed very wide differ- 
ences in prerequisite training. To help evaluate this factor a 
study was made of technical and professional school catalogues 
and of Freyer’s 1922 report. The average years of training 
needed was then placed after each item. There was considerable 
disagreement among the judges concerning some of the items, 
due to the fact that in some occupations the standards of train- 
ing are variable. The numbers selected represent the approxi- 
mate years of training needed for average success by the average 
person in the occupation in the Middle West. 

From a study of these figures, two large groups appeared: 
A. of occupations which needed more than 12th grade training 
or ability, and B. of occupations which generally needed less 
than 13 years of average training. These two groups served 
as a rough basis of further classification because the 12th grade 
is a fairly definite educational level, and there is a large gap 
between 12th grade work and further study. 

A further classification of occupations according to common 
occupational objectives and techniques was made by each judge 
using a variation of a paired comparison method. This resulted 
in six divisions in Group A. and six corresponding divisions in 
Group B. as shown in Table 2. 

There was little disagreement among the judges concerning 
these main divisions. It was felt by some that the sixth division, 
Government and Legal, could be logically divided between the 
Social and Commercial divisions. However, it was found to 
represent a large field with a special vocabulary and techniques, 
so it was included. Some of the judges also felt that a division 
of occupations which required special language training was 
needed. After considerable debate this motion was lost, be- 
cause it appeared that the language training was, with few 
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TABLE 2 
Subjective Classification of Occupations 








A. ABOVE 12TH GRADE B. BELOW 13TH GRADE 
1. Social Professions 7. Recreation 
Ministry Entertaining 
Personnel : 
8. O Tech 
Social Work yo seus miques 
2. Commercial Accounting 
Business Telegraphy 
Sal 7 ¥ 9. Mechanical Trades 
e Iron Work 
— Wood Work 
3. Physical Sciences a Work 
Transportation 
Physical El on 
Engineering —_ 
4. Biological Sciences 10. Agricultural Trades 
Food Growing 
Zoology 
Nutrition ; Forester 
Medicine Horticulture 
5. Fine Arts 11. Personal Service 
Plastic, Graphic Dressmaking, Tailor 
Musie Cleaning, Laundry 
Drama Hair Dressing 
Literature Food Preparation and 
6. Government and Legal Service 
Legislative 12. Police 
Military Sailor 





exceptions, subservient to writing about a field already in- 
cluded, or to the teaching of language. Such occupations as 
foreign correspondent, publicity agent, art critic, etc., were 
therefore placed in government, commerce, and art divisions, 
respectively. A similar motion for a mathematical division was 
lost for similar reasons. It was agreed that the classification 
might be changed on the basis of future findings. It was ad- 
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mitted that the classification emphasized practical vocational 
training, rather than functional or basic capacities or satisfac- 
tions. 

The choice of divisions was further aided by inspecting the 
analyses of Strong (1934), Thurstone (1935), and Cleeton 
(1935) shown in Table 3. The divisions used in this study 
correspond very closely with factor analyses of Thurstone and 
Strong in occupations above the 12th grade and fairly well with 
Cleeton’s classification throughout. The differences in classi- 
fication are due to variations in approach to the selection of 
items. Only from more careful definitions and factor analyses 


of large groups of items can such classification be made more 
effectively. 








TABLE 3 
Classification of Occupations Compared 
PRESENT THURSTONE STRONG CLEEBTON 
STUDY 1935 1934 1 
1. Social and People Teaching 
Ed. Service Descriptive (Maturity) Ministry 
2. Commercial Commercial Business Sales 
Accounting 
3. Physical Physical Science Physical 
Science Science (Intellect) Science 
4. Biological Biological Biological 
Science Science Science Science 
Nursing 
5. Art (Art) ————_ Artistic 
6. Government 
Legal Legal 
7. Recreation Athletics —_—_—_— —_—_— 
8. Office Tech- Office and 
niques —_—_—_——— ———— Clerical 
9. Mechanical Mechanical 
Trades Trades 
10. Agricultural 
Trades ——— __——- —————_ 
11. Personal Personal 
Service __ ——- Service 
12. —————— Academic Language Literary 


(Femininity) Legal 
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The actual placing of an item in a division resulted in very 
few disagreements. These appeared, in most cases, in making 
decisions between the A and B groups. Thus, a few items in 
Divisions 7 to 11 inclusive require more than 12th grade abil- 
ity. These items were placed as they appear in Table 4, because 
they seemed to have more in common with the techniques of the 
B group. The 6th and 12th divisions were combined because 
so few items were obtained for each. A detailed factor analysis 
is planned for these items to see to what extent each item may 
be dependent upon common factors, among various age, sex, 
and occupational groups. 


TABLE 4 
Michigan Vocational Preference List: Form II 








Name Ae cece eS A Ee 
Directions: If you would like the work in an occupation listed below, circle 
the (L) in front of it. If you would dislike it, cirele the (D). If you 
don’t know or if you are neutral, circle the (?). Please check all items. 
Work rapidly. Salary and working conditions are to be checked on 
another sheet, so do not allow them to influence your choices here. 


1. Social and Educational Service 


1. L ¢ D _ Boy or girl scout leader (12) 
2, L ¢ D Care of small children (12) 
3. L ¢ D Parole, probation officer (14) 
4. L # D Kindergarten teacher (15) 
5. L & D Missionary; home, foreign (15) 
6. L # D Grade school teacher (16) 
7. L ¢ D Visiting teacher (16) 
8 L # D Civies teacher (16) 
9 L ¢ D Language teacher (16) 
10. L # D Clergyman or priest (16) 
11. L # D Summer camp director (16) 
12. L  D Personnel director (16) 
13. L  D Columnist, human interest (16) 
14. L ¢ D Social service worker (17) 
15. L ¢ D Librarian (17) 
16. L ¢ D Psychologist, clinical (18) 
17. L ¢ D Sociologist (19) 
L 


Other 
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TABLE 4.—( Continued) 





ee ot ot ot 
BESSEASaAPRSRES 


o-e- a - 
oll allt od od Walt a 


SOPEPrnMPSPHPrr 


Sexrnerererrtr 


soll soll sal <a call call Stoll Sol a Sa Sn SO SO Sh SS 


sll spill poll Sul Spl cll Sh Shell Sa SA Sh SO SS 


<< <0 OO WO SO WO SO OS OO KO WO Oe OO SO SO OS SO OO WO 
~~ ~~~ 


ee ee ee ee ee 


\~B-B-B-B-B-B-B- 2 -e--e-) 


2. Commercial 
Clerk, drug store (9) 
Clerk, retail sales (9) 
Demonstrator (10) 
Manager, movie house 
Book agent 
Commercial traveler (13) 
Merchant, retail (13) 
Store keeper, head (14) 
Real estate agent (14) 
Personal Finance agent (14) 
Radio announcer (14) 
Salesman, automobile (15) 
Merchant, wholesale (16) 
Insurance agent (16) 
Buyer, Department store (16) 
Publicity agent (16) 
Press agent (16) 
Stock broker (16) 


_ Sales manager (16) 


Hotel manager (17) 
Bank officer (18) 
Other 





3. Physical Science 
Mathematics teacher (16) 
Surveyor (16) 

Building Contractor (16) 
Factory manager (16) 
Aviator (16) 

Chemist (16) 

Architect (16) 
Mechanical engineer (16) 
Electrical engineer (16) 
Physics teacher (17) 
Explorer, scientific (17) 
Inventor (18) 

Geologist (18) 
Astronomer (18) 

Other 
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TABLE 4.—( Continued) 
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4. Biological Science 
Dental hygienist (13) 
Osteopath (15) 
Physiotherapist (15) 
Nurse, graduate (16) 
Dietition (16) 
Pharmacist (16) 
Hospital Laboratory Technician (16) 
Biology teacher (17) 
Botany teacher (17) 
Dentist (18) 
Bacteriologist (18) 
Physician (20) 
Physiologist (20) 
Surgeon (20) 
Oculist (20) 
Psychiatrist (20) 
Other 





5. Arts 
Artists model (9) 
Dancer (12) 
Movie star (12) 
Illustrator (13) 
Actor or actress (14) 
Scenario writer (14) 
Modiste (15) 
Musician (16) 
Singer (16) 
Seulptor (16) 
Painter, artist (16) 
Interior decorator (16) 
Novelist (16) 
Orchestra conductor (17) 
Poet (17) 
Music teacher (17) 
Art teacher (17) 
Art critie (18) 
Playwright (18) 
Other 




















THE VOCATIONAL PREFERENCE LIST 567 


TABLE 4.—(Continued) 
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6. Government and Legal 
Soldier or sailor (8) 
Police officer or sheriff (10) 
Detective (12) 
State patrolman (12) 
Political organizer (15) 
Army officer (16) 
Government investigator (criminal) (16) 
Congressman (16) 
Interpreter (16) 
Foreign correspondent (16) 
Feature writer, political (17) 
Diplomatic service (18) 
Lawyer, criminal (19) 
Lawyer, corporation (19) 
Justice or Judge (19) 
Other 





7. Sports 
Baseball player (8) 
Professional athlete (8) 
Golf professional (10) 
Swimming instructor (10) 
Billiard expert (10) 
Bowling expert (10) 
Playground supervisor (14) 
Physical Education teacher (16) 
Gymnasium teacher (16) 
Athletic coach (16) 
Other 





&. Office 
Telephone switchboard operator (9) 
Filing clerk (9) 
Cashier, store (9) 
Typist (10) 
Mail clerk (10) 
Bookkeeper (11) 
Ticket agent, railway (12) 
Postmaster or mistress (12) 
Stenographer (12) 
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TABLE 4.—(Continued) 
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12. 
13. 
14, 
15. 
16. 
17. 
18. 
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Telegraph operator (12) 

Bank clerk (12) 

Secretary, private (12) 

City or county clerk (12) 
Linotype operator (12) 
Statistical worker (13) 

Office manager (16) 

Certified Public Accountant (19) 
Other 





9. Mechanical Trades 
Janitor or watchman (7) 
Chauffeur or taxi driver (7) 
Machine tender (7) 
House painter (9) 
Carpenter (10) 
Mason (10) 
Plumber (10) 
Welder (10) 
Stationary engineer (10) 
Machinist (12) 
Sheet metal worker (12) 
Electrician (12) 
Garage mechanic (12) 
Bus driver (12) 
Fire Department member (12) 
Telephone lineman (12) 
Radio operator (13) 
Locomotive fireman (13) 
Printer, machine (13) 
Instrument maker (14) 
Draftsman (15) 
Foreman, shop (15) 
Other 





10. Agriculture 
Rancher, cattle (8) 
Farmer, general (10) 
Poultry raiser (10) 
Forester (10) 
Truck farmer (10) 
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TABLE 4.—(Concluded) 





Tobacco farmer (10) 

Cotton grower (10) 

Dairyman (12) 

Fruit grower (12) 

Grain grower (12) 

Nurseryman (14) 

Agricultural experimenter (16) 
Government Agricultural agent (16) 
Other 


rere eh oe 
~—a~~—— e383 8 2 8 
vouvuvuuUuyS 





11. Personal Service 
Porter (4) 
Domestic helper (7) 
Waiter or waitress (7) 
Laundry worker (7) 
Manicurist (7) 
Cook (8) 
Barber (8) 
Corsetiere (9) 
Housekeeper (9) 
Tailor (9) 
Hairdresser (10) 
Cleaner and dyer (11) 
Milliner (11) 
Dressmaker (12) 
Domestic science teacher (16) 
Caterer or chef (16) 
Hotel hostess (16) 
Other 


14. 
15. 
16. 
17. 
18, 


~e <0 0 40 0 <0 oe 0 OO 8 OO OO OO 
~~ B-B~-B-e~-B----e--e- 


L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 








Considering all the items of this list please indicate your first three 
choices of occupations by placing a (1) in front of your first choice, a 
(2) in front of the second, and a (3) in front of the third. 


CHANGES DUE TO GROUPING OF ITEMS 


The results of grouping items as in Form IJ, will be discussed 
under three headings: 

1. Do Forms I and II cause different distributions of 
answers ? 

2. Which Form is more reliable? 
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3. Is there a halo effect due to grouping homogeneous items 
together ? 

In order to answer these questions the cooperation was secured 
of 211 college sophomores and juniors, most of whom were en- 
rolled in two psychology courses in the Literary College of the 
University of Michigan. Their ages and chosen occupations 
are shown in Table 5. 


























TABLE 5 
Data on Subjects 
MEN WOMEN 
CHOSEN OCCUPATION 

N 142 69 

MEAN AGE 18-7 TS8—+ 

8.D. 1-2 1-2 
1. Social and Educational ........................... 17 26 
2. Commercial 74 18 
3. Physical Science 9 1 
4. Biological Science 21 9 
5. Fine Arts 7 10 
6. Government Legal 14 1 
7. Undecided 0 4 














This shows that a majority of the men were planning to go 
into business, and a third of the women into social and educa- 
tional work. These choices should be kept in mind in interpret- 
ing the results which follow. Each subject completed each 
form twice, once at intervals of one day, and again, one week 
later. The forms were rotated to avoid practice effects. 


REsULTS 


1. Distribution of Answers. In order to answer the first 
problem, ‘‘ Were there significant differences between the two 
forms in the distribution of answers?’’ a tabulation was made 
of the frequency of the ratings of Like (L) Neutral (?) and 
Dislike (D) for each form. The results shown in Table 6 in- 
dicate that there were no significant differences of this sort 
either between the two forms or between the two trials of one 
form. There were small but. consistent sex differences. The 
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men showed a slightly greater proportion of ‘‘likes’’ than the 
women, and a slightly smaller proportion of ‘‘dislikes’’ than the 
women. The men and women showed nearly the same~propor- 
tion of indifferent or neutral answers. 


TABLE 6 
Percentage of Likes, Dislikes and Neutral Answers 

















MALE FEMALE 
L ? D L ? D 
Form I (Alphabetical) 
RRR eee 17.6 15.3 67.1 15.1 17.3 77.6 
BU CHRD siccdirncecien 149 11.6 74.5 11.7 164 719 
Form II (Grouped) 
Be I scinictitindtncitgtinn 18.3 144 67.3 14.1 14.0 719 
SnG GRN idea 164 156 68.0 — 123 15.1 72.6 





2. The Reliability of Each Form. The correlations between 
two trials of the same form for each person were secured by an 
unlike sign method adapted to three variables, which has been 
used throughout. Form I proved to be less consistent (average 
r=.76) than Form II (r=.89). The difference of .13 is seven 
times its own P.E. It must be concluded that the effect of 
Form II was to reduce considerably the random changes be- 
tween trials. In other words, when the 181 items were grouped 
under these eleven headings the students were more consistent 
in their ratings than when the items were in alphabetical order. 

3. The Correlations between Forms. The correlations be- 
tween Forms I and II were .84 and .81 on the first and second 
trials respectively. The fact that the correlations between 
Forms I and II were slightly higher than the self-correlation 
of Form I needs an explanation. This is probably because 
trials of Forms I and II were given on adjacent days, while an 
interval of one week intervened between two trials of the same 
Form. 

4. Halo Effects. If halo effects were present when the items 
were grouped they could be detected by comparing random 
changes with the experimental changes. The average random 
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changes should theoretically be zero. A halo effect would be 
indicated by a tendency to shift in one direction more than the 
other. In this investigation random changes were measured 
by tabulating the amount and direction of the changes in rat- 
ings which occurred between two trials of Form I, and two 
trials of Form II. These random changes were averaged. 

The average experimental changes were indicated by the 
amount and direction of changes in ratings between Form I 
(alphabetical) and Form II (grouped). The calculation of 
directional amount of shifting was made for each student as 
follows :* 


1. The letters circled in each column were summed for each 
trial of each form. 

2. To secure an index of random shift, the sums of the second 
trials were subtracted from the sums of the first for each form. 
These differences for Form I and Form II were added. (As 
a check, the plus differences must always equa! the minus 
differences. ) 

3. An index which showed the direction and amount of shift 
was found by multiplying the L’s by plus one, the I’s by zero, 

















TABLE 7 
The Calculation of a Halo Effect from one Subject in One Division. 
(Commercial) 
COLUMN TOTALS 
Diff. 1-2 
Trial 1 Trial 2 (Random) 
ee D Re D yt @ 
TE paticnrincciniealinn , 2:8 7 eae +2 -3 +1 
| ES 5 4 10 3 2 14 +24+2-4 
Diff. I-II (Experi- 
CONE mcenecaniitn +2-1-1 +2+4- 6 
Sums of Diffs. ....... +4+3-7 , +4-1-38 
FI ihe crisiniesin 11 L Experimental +7 L Random 











Note: This case shows four more changes toward the L rating in the 
experimental shifting than in the random shifting. 
1 We are indebted to Paul 8S. Dwyer of the Bureau of Education Investi- 


gations for aid in planning these calculations and for checking the corre- 
lational methods used. 
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and the D’s by minus one, and adding. Plus values then indi- 


cate a shift toward the Like (L), and minus toward the Dislike 
(D). 


4. To secure an index of experimental shift, the sums of Form 
II were subtracted from the sums of Form I. The procedure 
thereafter was identical with that described in 3 above. An 
example of the calculation of indices of shift is shown in Table 7. 

These calculations were completed for each person on each of 
the first three divisions: Social and Educational Service, Com- 
merce, and Physical Sciences. The cost of completing these 
computations for all the divisions proved prohibitive. The 
results on the first two divisions are considered to be more sig- 
nificant than the others because they were the fields most fre- 
quently chosen by these students. The average random shifts 
and experimental shifts are shown in Table 8. This table shows 


TABLE 8 
Random vs. Experimental Shifting (N = 211) 

















AV. 8.D. DIFF./8.D.,, 
1. Social and Ed; Service 
Random shift. ........................ .68L 5.6 
Experimental shift. .......... 1.30L 4.2 
DifFere me cee ccccececcsseeeevsennnen .72L 1.91 
2. Commercial 
Random shift .......................... .25L 5.4 
Experimental shift .......... 1.37L 5.2 
Difference ............................ 1.12L 1.96 
3. Physical Science 
Random shift ......................... 12L 3.0 
Experimental shift. .......... .00 2.9 
Diserenes 12L 37 





that all tendencies to shift were always positive and also near 
zero. The first two divisions showed a slightly larger experi- 
mental than random shift toward the ‘‘Likes’’ but differences 
were so small as to be of little significance. We must conclude, 
then, that a halo effect was present in the first division which 
caused an average shifting of .72 of one item of the nineteen, 
from a rating of Dislike (D) to Indifference (?), or from a 
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rating of Indifference (?) to Like (L). In the commercial 
division the halo effect was a little larger, 1.12 items of the 
twenty-one were shifted toward the ‘‘Likes’’ by the average 
student. In the case of the Physical Sciences, the shift was in 
the opposite direction for .12 of an item among the fourteen 
was rated more toward the ‘‘Dislikes’’ than in the random 
shifting of ratings. The last case is a nearly-chance variation. 
Whether or not these shifts were due to cognitive or affective 
groupings cannot be determined from this data. 


SUMMARY AND CONCLUSIONS 


One hundred and eighty-one names of occupations and pro- 
fessions for men and women were selected from the 1930 United 
States Census on the basis of number employed and the type 
of work. These names were arranged alphabetically in Form 
I. In Form II, the same items were grouped in twelve divi- 
sions. The first six divisions included professions which usu- 
ally require more than 12th grade training: 1. Social and Edu- 
cational Service, 2. Commerce, 3. Physical Science, 4. Biologi- 
cal Science, 5. Fine Arts, 6. Government and Legal. The last 
six divisions included occupations which usually require less 
than 13th grade training: 7. Recreation, 8. Office Work, 9. 
Mechanical Trades, 10. Agriculture, 11. Personal Service, 12. 
Police. 

Two hundred eleven college students completed two trials 
each of Form I and II. The forms were rotated so as to avoid 
practice effects. 

The results showed no significant differences in the distribu- 
tion of ratings between Forms I and II. Form II (grouped) 
was answered more consistently on the two trials (r,, =.89) 
than Form I (.76). The correlations between the two forms 
were approximately .825. There were very slight halo effects, 
shown by a tendency to ‘‘like’’ the occupations in the Social 
and Educational Service, and in Commerce, more in the grouped 
than in the alphabetical form. A very slight tendency to 
‘*like’’ the occupations in the Physical Sciences less in the 
grouped than-the alphabetical form was noted. 





THE VOCATIONAL PREFERENCE LIST 575 


From these results it must be concluded that the grouping 
of these items under eleven headings improved the reliability 
of the form, but resulted in no significant changes in distribu- 
tion of ratings, and caused no marked halo effects. 

Other investigations are needed to show whether these results 
will obtain from other age groups or from other occupational 
groups. 
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THE VALUE OF THE BERNREUTER PERSON- 
ALITY INVENTORY FOR DIRECT AND 
DIFFERENTIAL PREDICTION OF 
ACADEMIC SUCCESS AS MEA- 
SURED BY TEACHERS’ 

MARKS* 


CLAUDE L. NEMZEK 
University of Detroit 


URING recent years numerous investigations have been 
made to measure non-intellectual factors conditioning 
academic success. These investigations have closely 

followed the development and construction of new measuring 
instruments. The present study contains an analysis of the 
Bernreuter Personality Inventory and attempts to reveal any 
value it may have for direct and differential prediction of 
academic success as measured by teachers’ marks. 

The Bernreuter Personality Inventory has been adminis- 
tered to sophomore students at the University High School of 
the University of Minnesota in April of each year for a num- 
ber of years. Adequate data were available for 92 boys and 
99 girls on the following eight variables: 

(1) intelligence quotients, 

(2) B1-N scores, 

(3) B2-S scores, 

(4) B4-D scores, 

(5) honor point averages in mathematics, 

(6) honor point averages in English, 

(7) honor point averages in history and social science, 

(8) honor point averages in languages. 

* From a thesis submitted to the Graduate Faculty of the University 
of Minnesota in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy. 
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The measure of intelligence used in this study was based 
upon the results of five group intelligence tests. The tests 
employed were Army Alpha 8, Pressey Senior Classification, 
Haggerty Delta 2, Terman Group Test, Form A, and Miller’s 
Mental Ability Test, Form A. Intelligence quotients were 
computed from the results of each test for each individual. 
The author’s manuals for the respective tests were followed as 
closely as possible in administering and scoring, and, for all 
cases except that of the Pressey Test, in computing the intelli- 
gence quotients. In this instance, where the author’s norms 
proved inadequate for children who made unusually high 
scores, the difficulty was resolved by extrapolation. The in- 
telligence quotients were in all instances converted into Stan- 
ford-Binet equivalents by means of the method proposed by 
Miller. Of the five intelligence quotients, the middle value 
was chosen as the measure to be used for each individual. 

Marks at University High School are given in the form of 
letter ratings. For the present study the letter ratings were 
converted into honor point averages. Each quarter hour mark 
of A was given three honor points; each quarter hour mark of 
B, two honor points; each quarter hour mark of C, one honor 
point ; each quarter hour mark of D, no honor points ; and each 
quarter hour mark of F, minus one honor point. Then the 
total number of honor points in each of the four subject-mat- 
ter fields was divided by the total number of quarter hours oi: 
marks involved in the respective subject-matter fields in order 
to obtain the four honor point averages. 

Only those cases were retained for study which had two or 
more years of courses in each of the four subject-matter fields ; 
namely, mathematics, English, history or social science, and 
languages. All of the students had completed at least the 
junior year of high school; most of them had been graduated 
from the University High School when the present analyses 
were made. 

1W. 8S. Miller, ‘‘The Variation and Significance of Intelligence 


Quotients Obtained from Group Tests,’’ Journal of Educational Psychol- 
ogy, 15: 359-366, 1924. 
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The zero order Pearson product-moment coefficients of corre- 
lation were calculated between the several pairs of the eight 
variables for the boys and girls separately. Tables I and II 


TABLE I 
Pearson Product-Moment Coefficients of Correlation with Their Prob- 
able Errors Based upon (1) Intelligence Quotients, (2) B1-N Scores, 
(3) B2-S8 Scores, (4) B4—D Scores, (5) Honor Point Averages in Mathe- 
matics, (6) Honor Point Averages in English, (7) Honor Point Averages 
in History and Social Science, and (8) Honor Point Averages in Lan- 
guages of 92 Boys. 








pal 2 3 4 5 6 7 8 

B- semaape | 102 034 | —.088 438 460 538 458 
070 070 070 .057 055 .050 056 

Bacau " —.420 | -.752 | -—.113 | -.064 | -.027 | —.0003 
058 031 069 .070 .070 070 
© miskices 498 | .197 | .166 | .039 | .034 
053 068 068 070 070 
ee 037 | .089 | .002 | -.057 
070 070 .070 070 
© cin -630 670 641 
042 039 041 
} errno 803 -781 
025 | .034 
F  npecices -766 
029 


























include the zero order coefficients for boys and girls, respec- 
tively. Table III presents the means and standard deviations 
for both sexes. 

_ A pertisal of Tables I and II suffices to show that the Bern- 
reuter Personality Inventory has practically no value for pur- 
poses of predicting academic success as measured by teachers’ 
marks. The data indicate that the mental functions measured 
by the Bernreuter Personality Inventory have little, if any- 
thing, in common with the mental functions measured by 
teachers’ marks. In the case of the boys, the B1I—N scores 
correlate —.113 + .069 with honor point averages in mathe- 
matics ; — .064 + .070, with honor point averages in English ; 








—— 


PERSONALITY INVENTORY 579 


TABLE II 
Pearson Product-Moment Coefficients of Correlation with Their Prob- 
able Errors Based upon (1) Intelligence Quotients, (2) B1-N Scores, 
(3) B2-S Scores, (4) B4—D Scores, (5) Honor Point Averages in Mathe- 
matics, (6) Honor Point Averages in English, (7) Honor Point Averages 
in History and Social Science, and (8) Honor Point Averages in Lan- 
guages of 99 Girls. 
































prod 2 3 4 5 6 7 - 

$n. -.114 153 095 583 .602 486 571 
.067 .066 .067 045 .043 .052 046 
_eteneeaas. — .243 — .658 — .134 — .093 — .106 — .045 
.064 .038 .067 .067 .067 .068 
Petals 432 .087 .081 131 .056 
.055 .067 .067 .067 .068 
r eee .033 .073 111 | -.064 
.068 .067 .067 .068 
PR Dee .786 721 715 
.026 .033 .033 
igre Cerda 845 -787 
.019 .026 
in eee -708 
.034 

TABLE III 


Means and Standard Deviations of (1) Intelligence Quotients, (2) 
B1-N Scores, (3) B2-S Scores, (4) B4—D Scores, (5) Honor Point Aver- 
ages in Mathematics, (6) Honor Point Averages in English, (7) Honor 
Point Averages in History and Social Science, and (8) Honor Point Aver- 
ages in Languages for 92 Boys and 99 Girls. 














Boys (N =92) GIRLS (N = 99) 
VARIABLES 
Mean 8.D. Mean 8.D. 
D  vacsnainbteiinieninihanss 119.15 11.80 117.45 12.20 
I sichcctbaspsabvhaahenioiesiaads — 92.00 71.50 — 76.25 83.00 
_ SpE vee 14.50 45.75 - 20.25 52.00 
pe RRR ALP ee 33.75 61.00 21.25 59.00 
1.130 .695 1.055 .735 
1.055 .640 1.410 .650 
, Der 1.295 .735 1.225 -795 
seach astecnnisletae 1.105 -785 1.400 855 
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- .027 + .070, with honor point averages in history and social 
science ; and — .0003 + .070, with honor point averages in lan- 
guages. Only one of these coefficients is even larger than its 
probable error, and in that instance, — .113 + .069, the rela- 
tionship is too small to be of any statistical significance. 

Further study of Table I shows that B2-S scores correlate 
.197 + .068 with honor point averages in mathematics; .166 
+ .068, with honor point averages in English ; .039 + .070, with 
honor point averages in history and social sciences; and .034 
+ .070, with honor point averages in languages. The latter two 
coefficients are of a smaller magnitude than their probable 
errors ; the former two, while not statistically significant, show 
some degree of positive relationship. When intelligence is 
held constant by means of partial correlation, the relationship 
between the B2-S scores and the honor point averages in 
mathematics becomes .203 + .067. This coefficient does not 
meet the usual criterion for statistical significance, but it is 
3.03 times its probable error. One must conclude, however, 
that it has negligible value for prediction purposes. Simi- 
larly, the correlation between B2-S scores and honor point 
averages in English becomes .169 + .068 when intelligence is 
held constant by means of the partial correlation technique. 
This coefficient is not statistically significant, being 2.49 times 
its probable error. Moreover, it has negligible value for pur- 
poses of prediction. 

The B4-D scores correlate .037 + .070 with honor point 
averages in mathematics; .089 + .070, with honor point aver- 
ages in English ; .002 + .070, with honor point averages in his- 
tory and social science; and — .057 + .070, with honor point 
averages in languages. Only one of these coefficients is even 
larger than its own probable error. 

Similar data for girls indicate that the B1—N scores corre- 
late — .134 + .067 with honor point averages in mathematics; 
— .093 + .067, with honor point averages in English; — .106 
+ .067, with honor point averages in history and soeial science ; 
and —.045 + .068, with honor point averages in languages. 
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Not one of these coefficients meets the usual criterion of sta- 
tistical significance. 

The data show that the B2-S scores correlate .087 + .067 
with honor point averages in mathematics; .081 + .067, with 
honor point averages in English ; .131 + .067, with honor point 
averages in history and social science; and .056 + .068, with 
honor point averages in languages. No statistical significance 
may be attached to any of these coefficients. 

The B4—D scores correlate .033 + .068 with honor point 
averages in mathematics; .073 + .067, with honor point aver- 
ages in English; .111 + .067, with honor point averages in 
history and social science ; and — .064 + .068, with honor point 
averages in languages. These coefficients fail to reveal any 
significant relationships. 

The present data justify the same conclusion given by 
Finch and Nemzek? in a previous study in which they investi- 
gated the value of the Bernreuter Personality Inventory for 
the direct prediction of total scholastic achievement on the 
secondary school level. They conclude, ‘‘The data at hand 
furnish no evidence that the Bernreuter inventory is measur- 
ing any traits that contribute in any important degree to suc- 
cessful achievement in the high school.’’* 

Despite the fact that certain variables may be of little value 
for purposes of direct prediction, they may be more valuable 
as prognostic of differential ability. This is primarily due to 
the fact that in finding a differential correlation coefficient a 
negative and a positive direct relationship coefficient may be 
brought together and the effect is an additive one. A survey 
of Tables I and II reveals that of the 24 coefficients of correla- 
tion between the Bernreuter Personality Inventory scores and 
the honor point averages, fourteen are positive and ten are 
negative ; however, there are only six opportunities for negative 


2F. H. Finch and C. L. Nemzek, ‘‘The Relationship of the Bernreuter 
Personality Inventory to Scholastic Achievement and Intelligence,’’ 
School and Society, 36: 594-596, 1932. 

3 Ibid., 596. 
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and positive direct relationship coefficients to have an additive 
effect to produce higher differential coefficients of correlation. 

Despite the fact that the direct relationship coefficients are 
so low that they almost preclude any significant differential 
relationship coefficients, the value of the Bernreuter scores for 
purposes of differential prediction was determined by Segel’s* 
method. In Tables IV and V are included the differential 
prediction coefficients based upon the data available for the 
92 boys and the 99 girls under consideration. In Table VI 
are presented the means and standard deviations in step inter- 
val units as they were substituted in Segel’s formula for 
obtaining differential prediction coefficients.® 

A study of the data based upon the boys indicates that the 
Bernreuter Personality Inventory has no value for predicting 
the differences between the honor point averages for mathe- 
matics and English (5-6). The differences correlate — .066 
+ .070, .054 + .070, and — .054 + .070 with B1-N scores, B2-S 
scores, and B4—D scores, respectively. Not any of these coeffi- 

* David Segel, Differential Diagnosis of Ability in School Children. 
Baltimore: Warwick and York, Inc., 1934. Pp. 86. David Segel, Pre- 
diction of Success in College. Office of Education Bulletin 1934, No. 15. 
Washington, D. C.: Government Printing Office, 1934. Pp. 98. 

David Segel, ‘‘The Construction and Interpretation of Differential 
Ability Patterns,’’ Journal of Experimental Education, 2: 283-287, 1934. 

David Segel, ‘‘ Differential Prediction of Ability as Represented by 
College Subject Groups,’’ Journal of Educational Research, 25: 14-26, 
January, 1932; 25: 93-98, February, 1932. 

David Segel, ‘‘ Differential Prediction of Scholastic Success,’’ School 
and Society, 39: 91-96, January 20, 1934. 

David Segel and J. R. Gerberich, ‘‘ Differential College Achievement 
Predicted by the American Council Psychological Examination,’’ Journal 
of Applied Psychology, 17: 637-645, December, 1933. 

J. Murray Lee and David Segel, ‘‘ The Utilization of Data from Simple 
or Direct Prediction in the Development of Regression Equations for 
Differential Prediction,’’ Journal of Educational Psychology, 24: 550- 
554, October, 1933. 

5 The formula is: 

Tex On ~ Tox Or 
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TABLE IV 


Differential Prediction Coefficients of Correlation Based upon (1) Intel- 
ligence Quotients, (2) B1-N Scores, (3) B2-S Scores, (4) B4-—D Scores, 
(5) Honor Point Averages in Mathematics, (6) Honor Point Averages in 
English, (7) Honor Point Averages in History and Social Science, and 


(8) Honor Point Averages in Languages for 92 Boys* 




















VARIABLES 1 2 3 4 
OD asin siete 017 — .066 .054 — .054 
SERRE ¢ — .156 -.101 .186 — .042 
ME  cisistitnies te — .087 —.124 174 111 

— 329 — .069 .253 181 
—.109 — .073 144 .184 
.069 — .037 005 .088 








* The probable errors of these differential prediction coefficients range 
from .063 to .070. These probable errors were calculated by the usual 
formula. It has been suggested by Dr. George K. Bennett, of the Psy- 
chological Corporation, that the probable errors thus derived are too small. 
He recommends the use of the probable error of the difference between r’s 
as @ more accurate method of obtaining the probable errors. The larger 
probable errors would demonstrate more decisively the lack of differential 
predictive value. The same conclusion applies to the data in Table V. 


TABLE V 


Differential Prediction Coefficients of Correlation Based upon (1) Intel- 
ligence Quotients, (2) B1-N Scores, (3) B2-S Scores, (4) B4—D Scores, 
(5) Honor Point Averages in Mathematics, (6) Honor Point Averages in 
English, (7) Honor Point Averages in History and Social Science, and 
(8) Honor Point Averages in Languages of 99 Girls.* 








VARIABLES 1 2 3 4 
-086 — .088 .026 — .054 
.073 — .024 — .070 -.111 
— .098 — .098 .026 .130 
-010 .061 — .130 - .161 
— .184 — .041 -010 .150 
-.161 — .073 .089 .226 




















*The probable errors of these differential prediction coefficients range 


from .064 to .068. 
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TABLE VI 
Means and Standard Deviations in Step Interval Units of (1) Intelli- 
gence Quotients, (2) B1-N Scores, (3) B2-S Scores, (4) B4—D Scores, 
(5) Honor Point Averages in Mathematics, (6) Honor Point Averages in 
English, (7) Honor Point Averages in History and Social Science, and 
(8) Honor Point Averages in Languages for 92 Boys and 99 Girls. 











Boys (N =92) GIRLs (N = 99) 

VARIABLES 

Mean 8.D. Mean 8.D. 
Be ices 9.83 2.36 9.49 2.44 
Be issih 18.32 2.86 18.95 3.32 
ERS 22.58 1.83 21.19 2.08 
LS 23.35 2.44 22.85 2.36 
aT 5.26 1.39 5.11 1.47 
RESP? a 5.11 1.28 5.82 1.30 
Re e 5.59 1.47 5.45 1.59 
OD scaseconatdets 5.21 1.57 5.80 1.71 

















cients are as large as their probable errors; hence, they do not 
have statistical significance. 

The differences between the honor point averages for mathe- 
matics and history and social science (5-7) correlate —.101 
+ .070, .186 +: .068, and -.042+ .070 with B1-N scores, 
B2-S scores, and B4—-D scores, respectively. It is apparent 
that these coefficients are not statistically significant. When 
intelligence is held constant by means of partial correlation, 
the relationship between the B2-S scores and the differences 
between the honor point averages for mathematics and history 
and social science changes from .186 + .068 to .193 + .068. 
This partial coefficient is 2.84 times its probable error; how- 
ever, it has practically no value for prediction purposes. 

The differences between the honor point averages for mathe- 
matics and the languages (5-8) correlate —.124 + .069, .174 
+ .068, and .111 + .069 with B1-N scores, B2-S scores, and 
B4-D scores, respectively. These coefficients indicate that the 
Bernreuter Personality Inventory scores are of negligible 
value for predicting differences in achievement between 
mathematics and languages. 
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The differences in achievement between English and history 
and social science (6-7) correlate — .069 + .070, .253 + .066, 
and .181 + .068 with B1-—N scores, B2-S scores, and B4—-D 
scores, respectively. When intelligence is held constant by 
means of partial correlation, the relationship of .253 + .066 
between B2-S scores and differences in achievement between 
English and history and social science becomes .280 + .065. 
This coefficient is slightly more than four times its probable 
error, but it is very doubtful if it has any decided practical 
value. The differential prediction coefficient ri,¢-;), is — .329 
+ .063; whereas, the direct prediction values, rig and riz, are 
.460 + .055 and .538 + .050, respectively. These data cer- 
tainly demonstrate that the intelligence quotients are more 
effective for direct prediction than for differential; further- 
more, the Bernreuter Personality Inventory scores have negli- 
gible value for direct or differential prediction. 

The differences in achievement between English and the 
languages (6-8) correlate — .073 + .070, .144 + .069, and .184 
+ .068 with B1-N scores, B2-S scores, and B4—D scores, re- 
spectively. No statistical significance may be attached to these 
coefficients. The same conclusion applies to the relationships 
obtained when the differences in achievement between history 
and social science and languages (7-8) are correlated with 
B1-N scores, B2-S scores, and B4—D scores, the coefficients 
being — .037 + .070, .005 + .070, and .088 + .070, respectively. 

The data presented in Table IV have failed to reveal any 
value that the Bernreuter Personality Inventory may have for 
purposes of differential prediction. This finding is also cor- 
roborated by the results based upon the girls as shown in 
Table V. In conclusion one may say that the present investi- 
gation indicates that the Bernreuter Personality Inventory 
has no practical value for purposes of direct or differential 
prediction of academic achievement under the conditions ex- 
isting among students at the University High School of the 
University of Minnesota. 
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SUMMARY AND CONCLUSIONS 


The purpose of this study was to reveal the value of the 
Bernreuter Personality Inventory for purposes of direct and 
differential prediction. Data were available for 92 boys and 
99 girls, and the following eight variables were used : 

(1) intelligence quotients, 

(2) B1-N scores, 

(3) B2-S scores, 

(4) B4D scores, 

(5) honor point averages in mathematics, 

(6) honor point averages in English, 

(7) honor point averages in history and social science, and 

(8) honor point averages in 
These data demonstrate that the Ridin Personality Inven- 
tory has negligible value for direct or differential prediction. 











USE OF THE INTERNATIONAL SCORING 
MACHINE FOR THE RAPID COMPU- 
TATION OF TABLES OF INTER- 
CORRELATIONS 


G. FREDERIC KUDER 
The University of Chicago 


OYER and Toops' have developed a rapid method for 
R obtaining correlations from Hollerith cards on which 
the geometric codes of scores for each variable have 
been punched according to a method described by Dunn.? The 
Royer-Toops method calls for punching a card for each indi- 
vidual, and requires the use of a Hollerith key-punch and a 
counter-sorter. By the simple expedient of preparing an an- 
swer sheet and stencil for each code, it is possible to use the 
International Scoring Machine for obtaining the correlations. 
This substitution is not recommended if Hollerith equipment 
is easily available. The technique will, however, be found to 
be much more efficient than the usual manual methods if inter- 
correlations of at least five variables are to be obtained. Its use 
is most efficient when the number of cases is between fifty and 
two hun“red. 

The detailed outline of the method follows. 

1. Prepare a list of the subjects and their raw scores on the 
variables involved. Allow two blank columns after the list of 
subjects, and five narrow blank columns after the raw scores for 
each variable. 

1 Royer, E. B., and Toops, H. A. ‘‘The Statistics of Geometrically 


Coded Scores,’’ Journal of the American Statistical Association, Vol. 28, 
1933, pp. 192-198. 

2 Dunn, Halbert L. ‘‘ Adaptation of New Geometrical Code to Multiple 
Punching in Mechanical Tabulation,’’ Journal of the American Statistical 
Association, Vol. 27, 1932, pp. 279-286. 
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In Table 1, the subjects have been listed in column (1), 
and the raw scores of variables X, Y, and Z have been listed 
in columns (4), (10), and (16), respectively. 

2. Divide the range of the raw scores of each variable into not 
more than sixteen class intervals. Assign grouped scores of 
from 0 to 15 to these class intervals. Prepare tables or formulas 
for transmuting raw scores into grouped scores. 


The formula developed for transmuting Variable X into 





grouped scores is X = *- 2 in which X = grouped score, and 


X’=rawscore. Fractions are dropped in using this formula. 
The same result is produced by developing a table. In this 
case the table is: 


Raw scores Grouped score 

51-53 15 

48-50 14 

45-47 13 

42-44 12 

39-41 11 

| 36-38 10 
33-35 9 
| 30-32 8 
27-29 7 

24-26 6 

21-23 5 

18-20 4 

15-17 3 

12-14 2 

9-11 1 

6- 8 0 

The formulas used for grouped scores Y and Z are as follows: 








Y=771 tebe shot 


Fractions are always dropped in using these formulas. Dif- 

ferent formulas or tables must, of course, be developed for 

each variable. 

3. Enter in the second column for each variable the grouped 
score corresponding to the raw score. 


These have been entered in columns (5), (11), and (17) 
in Table 1. 
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Coding of Scores 
(Adapted from Royer and Toops) 
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4. Label the next four columns for each variable 1, 2, 4, and 
8, respectively. Then for each grouped score, place checks in 





the following columns so that the sum of the values of the checks 


exactly equals the grouped score. 


should be placed in a column. 
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Columns (6) to (9) have been labeled 1, 2, 4, and 8, re- 
spectively, for Variable X. Columns (12) to (15) and (18) 
to (21) have been labeled in the same way for Variables Y 
and Z. 

The X-score for individual A is 11; it is therefore repre- 
sented by checks in score code columns 1, 2, and 8, since the 
sum of these numbers is 11. A’s Y-score is 8; this score is 
therefore represented by a check in code column 8 for the 
Y-seores. The score of 12 in Variable Z is represented by 
checks in code columns 4 and 8. All grouped scores in the 
table have been coded in this fashion. 


5. For this and following steps, International Scoring Ma- 
chine answer sheets are used (although the actual entries made, 
as outlined below, are codes rather than answers). 


See Figure 1 for an illustration of part of an answer sheet. 
This represents the left portion of the standard form for 150 
questions with five alternatives each. The large figures are 
question numbers; the small figures at the five spaces after 
each question number indicate the answer positions. 


Assign each individual an answer position on the answer sheet. 
Write the question number in column (2) and the answer posi- 
tion in column (3). It is recommended that positions be as- 
signed in sequence by columns on the answer sheet. 


In Table 1, the first subject, A, has been assigned position 
1-1 (answer position 1 after question number 1 on the 
answer sheet). Subject B has been assigned position 2-1, 
and Subject T has been assigned position 20-1. If there were 
more subjects, the 31st subject would be given position 1-2 
at the head of the second column of answer positions, and the 
60th subject would be given position 30-2, at the foot of the 
second column. The 61st subject would be given position 1-3 
at the head of the third column, and so on. 


6. Label a separate answer sheet for each of the code columns 
of each variable. 


In the case of Table 1, 12 answer sheets are prepared, 
labeled, respectively, X-1, X-2, X-4, X-8, Y-1, Y-2, Y-4, 
Y-8, Z-1, Z-2, Z-4, and Z-8. 
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Figure 1. Entries on answer sheet for code X-1. 
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7. Fill in the answer sheet for each code column by black- 
ening the positions of those people who are checked in the 
column. Keep these answer sheets in order. 


For example, the ‘‘answer sheet’’ for code X—1 as illus- 
trated in Figure 1 has the following positions blackened : 
1-1, 6-1, 7-1, 8-1, 9-1, 13-1, 15-1, 18-1, 19-1, and 20-1. 
8. Make up code stencils for every answer sheet, each stencil 

having a hole corresponding to every mark on the correspond- 
ing answer sheet. 


This means making twelve stencils for the example used. 

9. Prepare a table similar to Table 2A, listing the variables 
in order and allowing five spaces in each direction for each vari- 
able. Also allow two extra columns at the right. In labeling 
the rows at the left, leave the first of the five spaces for each 
variable unlabeled, and write the code numbers of the answer 
sheets for each variable in the remaining four spaces in order 
of size from top to bottom. In labeling across the tops of the 
columns, write these same numbers, from left to right, in the 
first four spaces, leaving the fifth space blank. 


This has been done in Table 2A. 


10. Put the first stencil (X—1) in the machine and score all 
the answer sheets in order (for number ‘‘right’’). The scores 
are entered, in order, in the column for code X—1 of the table. 


See column (2) in Table 2A. 


Then score all the sheets with the second stencil (X~—2), entering 
the scores in column X-2 of Table 2A. Continue in this man- 
ner until all sheets have been scored by all stencils, and the 
table is complete. Note that each combination is entered twice 
in the table, except for the diagonal entries which are the num- 
ber of positions blackened on each answer sheet. The diagonal 
entries should be obtained twice, as a check. 


In the example used, the entry in the upper left cell is the 
number of positions which have been blackened on sheet X-1. 
The entry in the second cell in column X~1 is the number 
obtained when sheet X-2 is scored by stencil X-1. 
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TABLE 2A 
Table of Numbers of Entries Common to Each Pair of Code-Coluwmns 
in Table 1 
(Adapted from Royer and Toops) 
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11. Check the table for symmetry about the diagonal running 
from upper left to lower right. Column X~—1 reading from top 
to bottom should have the same entries as row X~—1, reading 
from left to right, and so on. 

12. In each block of 16 cells obtain the sum of the entries 
along each diagonal running from lower left to upper right; 
multiply each diagonal total by the amount given in the follow- 
ing diagram, and obtain the sum of these products. Enter this 
figure in the cell at the lower right corner of the block, and draw 
a ring around it. 


Multipliers 








‘ 
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products here 
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The total obtained is [XY, the sum of the products of the 
grouped scores of the variables listed at the end of the row and 
the head of the column. In the case of the diagonal blocks this 
total is {X?, the sum of the squares of the one variable involved. 


Take as an example the group of sixteen cells in the X-row 
and Y-column of Table 2A. These are added along the 
diagonals, and the sums listed as indicated : 














The operations involved are 
ous Sum on 
Multiplier diagonal 
1 x 4 = 4 
2 x 9 = 18 
4 x 15 = 60 
8 ~ 19 = 152 
16 x 16 = 256 
32 x 9 = 288 
64 x 6 s 384 
Total = 1162 


These steps can be performed in sequence on the calculating 
machine and the total obtained without clearing. If the mul- 
tipliers are memorized, this operation proceeds rapidly. Step 
12 has been performed for all variables in Table 2B. 

13. Copy the diagonal entries of the table in a ‘‘total’’ column 
at the right. Multiply each entry by the code value of the row 
and obtain the sum of these four products for each variable. 
Enter this sum in the next column, and circle the entry. This 
operation should be performed twice. The quantity so ob- 
tained is {X, the sum of the grouped scores of the variable 
involved. 
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TABLE 2B 
Use of Table 2A for Obtaining Data for the Computation of 
Intercorrelations 
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In Table 2B, the diagonal entries (corresponding to the 
number of entries on each answer sheet) have been entered 
in column (17). The sum of the four products of these 
entries and the code values of their rows have been entered in 
column (18) for each variable. 


14. Proceed with the calculation of the intercorrelations, 
making use of the circled entries. 


An efficient systematic procedure for reducing the quan- 
tities obtained in the second table to an intercorrelation 
table has been described by Toops.* 

It should be noted that the amount of labor involved in the 
preparation of stencils and scoring can be reduced by at least 
a third if the computer is willing to use eight class intervals 
instead of sixteen. Only three code sheets and stencils are then 
necessary for each variable. 


8 Toops, Herbert A. ‘‘Computing Intereorrelations of Tests on the 
Adding Machine,’’ Journal of Applied Psychology, Vol. 6, 1922, pp. 
177 ff. 
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If a large number of runs is called for, a set of answer sheets 
should be made up for each thirty runs. It has been general 
experience that if an answer sheet is put through the machine 
much oftener than this, inaccuracies are likely to occur. The 
extra sets of answer sheets are quickly made, however, by using 
the stencils as guides in filling out the new sheets. 

















AN ANALYSIS OF EIGHT MEASURES OF PER- 
SONALITY AND ADJUSTMENT IN RELA- 
TION TO RELATIVE SCHOLASTIC 
ACHIEVEMENT* 


NEAL E. DROUGHT 
University of Chicago 


HE rapid increase in college enrollment figures during 
the post-war years with the resulting increase in student 
mortality gave rise to many attempts to provide an index 

of probable scholastic achievement. (For an excellent sum- 
mary of the more noteworthy of these studies see 1: 1-40). 
The majority of these studies have used either a test of scho- 
lastic aptitude or a measure of previous achievement (such as 
high school average or percentile rank in class) or both. The 
correlations between predicted and achieved grades have quite 
regularly fallen within the range of plus .55 and plus .70. The 
fundamental assumption of the present study is that the dis- 
crepancies between predicted and achieved grades implied by 
these correlations are to be accounted for in some measure by 
social and emotional factors. 

With a view toward obtaining a better understanding of 
these factors, two measures, each with four subdivisions, were 
administered to 750 freshman men entering the College of Let- 
ters and Science of the University of Wisconsin in September, 
1935. The two scales employed were the Adjustment Inven- 
tory by Hugh Bell* and the Wisconsin Scale of Personality 


* The study of which this represents a part was made possible by a grant 
from the research committee of the Graduate School of the University of 
Wisconsin and received aid in the form of clerical help from the Works 
Progress Administration. The writer is indebted to Professor Kimball 
Young for continued assistance and encouragement in this study. 

1 The Adjustment Inventory by Hugh M. Bell. Published by the Stan- 
ford University Press, Stanford University, California. 1934. 


597 








ENN 


a a eee 








598 NEAL E. DROUGHT 


Traits by Ross Stagner.? Those acquainted with these mea- 
sures will recall that the Bell Inventory provides measures of 
Home, Health, Social and Emotional adjustment. The four 


divisions of the Stagner scale are described by the author as 
follows : 


‘*On scale W a high score indicates a person who is nervous, 
moody, tends to worry over trifles, and who is likely to 
adopt defense mechanisms. (Nervousness and emo- 
tionality.) 

**On scale X a high score indicates an individual who has a 
highly subjective orientation, a tendency to daydream, 
and a susceptibility to criticism. (Introversion.) 

**On seale Y a high score indicates high self-esteem and self- 
confidence. 


**On seale Z a high score indicates an individual who is averse 
to social contact, prefers intellectual to social or athletic 
activities, and takes a passive réle. (Social passivity or 
seclusiveness. )’”* 

The criterion of scholastic achievement used in this study 
was the discrepancy between predicted and achieved grade 
point averages. At the University of Wisconsin the predicted 
average is obtained by a regression equation having as variables 
the student’s percentile rank in his high school class and his 
percentile rank on the American Council Test of Scholastic 
Aptitude administered during freshman orientation week. 
(See 2.) 

The achieved average is determined by assigning three grade 
points per credit for a grade of ‘‘A’’, two grade points per 
eredit for a ‘‘B’’, one per credit for a ‘‘C’’, zero for a ‘‘D’’, 
minus one-half for an ‘‘E’’, and minus one for a failure, and 
then dividing total credits into total grade points. 

Thus by using the discrepancy between predicted and actual 
grades as a criterion of achievement rather than the attain- 
ment of an arbitrary standard such as a 1.00 (‘‘C’’) average 
we have in some measure controlled the factor of scholastic 


2 The Wisconsin Scale of Personality Traits by Ross Stagner. Published 
for the author by the Bernat Publishing Co., Chicago. 1935. 
3’ From a communication from the author. 
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aptitude. In terms of this definition a student with a grade 
point prediction of 2.00 or ‘‘B’’ who achieves a 1.00 or ‘‘C”’ 
average is just as much a ‘‘failure’’ as is a student having a 
prediction of 1.00 (‘‘C’’) who attains an average of .00 (‘‘D’’). 
Similarly it is possible to pick out those students whose achieve- 
ment is abnormally high in relation to their prediction. 


PROCEDURE 


The first step was to determine the correlation between grade 
point discrepancies and scores on each of the subdivisions of 
the Bell and Stagner scales. 

The following list shows that in all cases these correlations 
were of low order. 


Correlation 
First semester grade point discrepancy 
and home adjustment (Bell A) 
health adjustment (Bell B) 
social adjustment (Bell C) 
emotional adjustment (Bell D) 


nervousness (Stagner W) 
introversion (Stagner X) 
self-esteem (Stagner Y) 
social passivity (Stagner Z) 


In no case was there a suggestion of reliable relationship. This 
is in essential agreement with previous studies employing 
similar measures. 

The second step in the analysis was to determine to what 
extent the scales differentiated between students with dis- 
crepancies of at least plus .60 grade points (14 PE) and those 
with at least minus .60 grade points. These distributions 
were obtained from an unselected sample of 250 cases from 
the original group of 750. The scores for each of the 90 stu- 
dents in the plus group and the 115 in the minus group were 
recorded for each of the eight personality measures and the 
mean for each of these distributions was determined. 
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TABLE I 


Distribution of Scores on the Four Subdivisions of the Bell Inventory for 
Students Whose Discrepancies Were Plus or Minus .60 Grade Points 
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For example, the average score on the home adjustment scale 
for students in the plus group was 7.1 and for the minus group 
5.8. The difference between these means is 1.3 with a probable 
error of .58. In a similar fashion the same data were secured 
on each of the remaining seven scales. In no case did the 
difference between the means equal or exceed four times the 
probable error of the difference. 

Similarly, as a third step, the average grade point dis- 
crepancies were determined for students with ratings of ‘‘excel- 
lent’’ and ‘‘unsatisfactory’’ on each division of the Bell Inven- 
tory and for students scoring in the lowest and highest deciles 
on the four divisions of the Stagner scale. Naturally the 
number of cases varied from scale to scale. 

In this analysis it was found that the difference between 
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TABLE II 


Distribution of Scores on the Four Subdivisions of the Stagner Scale for 
Students Whose Discrepancies Were Plus or Minus 











.60 Grade Points 
NERVOUSNESS | INTROVERSION | SELF-ESTEEM _ 

IN PER- 

CENTILE Plus | Min Plus | Minus} Plus | Minus| Plus | Minus 
91-100 ......... 19 10 16 i] 15 17 19 12 
81— 90 ......... 7 6 13 9 6 5 5 6 
71-— 80. 9 12 6 10 6 13 11 3 
61— 70 ......... 1 11 4 9 10 y 6 15 
51— 60 ....... ‘ 8 2 8 7 8 14 11 12 
41— 50 ......... i) 17 11 14 9 7 9 18 
31— 40 ......... 7 14 10 12 ll 20 8 20 
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O— 10 ......... 14 21 9 17 8 14 15 20 
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the mean grade point discrepancies was at least four times 
the probable error of this difference for students scoring high 
and for those scoring low on the X and Z scales of the Stagner 
measure indicating introversion and social passivity respec- 
tively. On scale X the difference between the means was 
.29 + .07, and on scale Z the difference between the means was 
30 + .06. 

Apparently then introversion and social passivity are in 
some measure related to scholastic achievement. An examina- 
tion of the distributions, however, indicates that we should be 
confusing the issue by simply resorting to these statistical 
devices without further analysis. 

The final step in this section of the study was, then, to analyze 
the tests item by item. This was done by determining the 
differences in the percentage of ‘‘yes’’, ‘‘?’’, and ‘‘no’’ re- 
sponses on each part of the Bell and Stagner scales given by 
students who achieved above prediction and those who fell 
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TABLE Iit 


Distribution of Grade Point Discrepancies of Students Scoring High and 
Low on the Four Subdivisions of the Bell Inventory 
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below prediction. Selecting from the Bell Inventory the ten 
items showing the most marked differentiation, we should 
describe the poor student as one who does not often feel self- 
conscious because of his personal appearance, is in doubt as to 
whether or not he is often the center of favorable attention 
at a party, has not frequently had to keep quiet or leave the 
house in order to have peace at home, does enjoy social danc- 
ing a great deal, does not get discouraged easily, does like to 
participate in festival gatherings and make ‘‘whoopee,’’ does 
hesitate to enter a room by himself when a group of people are 
sitting around the room talking together, does not sometimes 
envy the happiness others seem to enjoy, does not cross the 





SCHOLASTIC ACHIEVEMENT 603 


TABLE IV 
Distribution of Grade Point Dsicrepancies of Students Scoring High and 
Low on the Four Subdivisions of the Wisconsin Scale 
of Personality Traits 
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street to avoid meeting somebody, and does take the responsi- 
bility for introducing people at a party. 

It will be noted that six of the items involve social partici- 
pation, three relate to emotional adjustment and one purports 
to refer to home adjustment. 

With reference to the Stagner scale the ten items showing 
the greatest differentiation would suggest that the poor stu- 
dent does not like to be by himself a great deal, does not in- 
dulge much in daydreams, does not like to have quiet amuse- 
ments rather than exciting ones, does not get tired of work 
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quickly, does not like to spend his vacations at some quiet 
place rather than at a lively resort, does not think much about 
it when praised, is not often considerably upset when un- 
successful in any undertaking, does not usually think a great 
deal before deciding anything, would not like to listen to a 
lecture on classical music, and does not usually talk spon- 
taneously. Five of these reflect social passivity, four intro- 
version, and one emotionality or nervousness. 

On the basis of these ten selected items from each measure, 
the scales were scored for a group of 563 students for whom 
the necessary data were available. Each of the items was 
considered a unit value since no weighting scheme seemed 
warranted at this stage of development. A high score was 
associated with poor achievement. 

The scores for the two sets of ten items were combined to 
give a single index and the correlation between these selected- 
items scores and grade point discrepancies was + .20. 


CONCLUSIONS 


It becomes apparent that the tests used in this study are of 
little value for the purpose of differentiating between stu- 
dents who will obtain grades above prediction and those who 
will fall markedly below prediction. 

In regard to the Wisconsin Scale of Personality Traits this 
conclusion is simply a verification by a new technique of the 
results of previous investigators using similar scales. Stagner 
himself has made one of the most extensive investigations of 
the relationship between intelligence, achievement, and per- 
sonality factors and found all correlations to be low (3). The 
present data do not, however, corroborate his conclusion that 
‘‘Extreme personality measures seem to counterbalance ad- 
vantages in aptitude, making for equal achievement in op- 
posed groups’’ (3: 10). Were this true, we should expect 
a significant negative correlation between percentile rank on 
emotionality or self-sufficiency and grade point discrepancy. 
Such was not the case. 
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Insofar as extreme scores on the parts of the Wisconsin 
Scale may be taken to indicate adjustment or maladjustment 
the same criticism may be applied as is apparent in regard to 
the Adjustment Inventory. The present data show quite 
clearly that there is no relationship between adjustment as 
measured and relative performance in college. And yet there 
is abundant literature to support the thesis that social and 
emotional adjustment, for example, do play an important part 
in college achievement. It is unquestionably true that mal- 
adjustment may show itself by abnormally high achievement 
as well as by failure, but an examination of the grade point 
discrepancies for students with high and low scores on the 
Adjustment Inventory shows no bimodal distribution such as 
would be expected were the scales consistent in pointing out 
maladjustment. 

It thus becomes clear that a different approach will be nec- 
essary if the problem of obtaining an index of personal-social- 
emotional factors in college achievement is to be solved. 


SUMMARY 


In an effort to account for the personal-social-emotional 
factors operative in scholastic achievement, the Adjustment 
Inventory by Hugh Bell and the Wisconsin Scale of Person- 
ality Traits by Ross Stagner were administered to 750 fresh- 
man men entering the College of Letters and Science of the 
University of Wisconsin in September, 1935. The scores on 
the four subdivisions of each of these measures were corre- 
lated with the discrepancy between predicted and achieved 
grades. These correlations were uniformly low. 

There was no significant difference in the average scores on 
any of the eight scales between students whose achieved grades 
were markedly below prediction and those whose grades were 
markedly above prediction. 

In comparing the average grade point discrepancies of stu- 
dents with high and with low scores on each of the eight parts 
of the two measures it was found that there was a significant 
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difference in average achievement between students who 
scored high and low on introversion and on social passivity as 
measured by the Stagner scale. The students scoring high on 
introversion and social passivity had the higher average 
grades. 

The two measures were analyzed item by item and there 
were selected the ten items from each showing the greatest 
difference between students who worked above prediction and 
those who worked below prediction. When a group of 563 
tests were scored on the basis of these items, the correlation 
between these scores and grade point discrepancies was found 
to be + .20. 

These measures are, therefore, of limited value for the pur- 
pose of this study. It is apparent that a new approach will 
have to be worked out for obtaining a measure of those per- 
sonal-social-emotional factors related to academic achievement. 
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MENTAL ABILITIES AT SENESCENCE: A SUR- 
VEY OF PRESENT-DAY RESEARCH 


GEORGE LAWTON 
Columbia University 


F the various life epochs that have been studied by scien- 
C) tific investigators, old age has received the least atten- 
tion. In recent years, however, the problems of the 
aged have begun to force themselves upon us because of socio- 
economic factors. A declining birth-rate and an improvement 
in medical skill have combined to raise markedly the average 
age of the population since the turn of the century. But while 
the life-span has been lengthened, the work-span has been cur- 
tailed and though large sums are being spent in the attempt to 
support the improvident aged, their numbers are constantly 
being increased by economic conditions. 

Nor should the problem be limited to keeping the aged alive, 
even assuming that our provisions for their economic and physi- 
cal welfare were adequate. If we are to discover and provide 
what old people need in order to be most useful and happy, it is 
important that we make accurate determinations of their men- 
tal abilities and emotional lives, and discard many of the tradi- 
tional and folk-lore judgments now current. But while the 
problems of the aged are manifold: economic, medical, emo- 
tional, psychiatric, educational and occupational, there is very 
little integrated research to guide us in a consideration of these 
problems. It is the purpose of this paper to sketch the high- 
lights of psychometric contributions to the science of geriatrics 
now in its birth-throes. 

Most of the early treatments of senescence were concerned 
with physiological and philosophical techniques for postponing 
decline and death. Typical of this approach is Mechnikov’s 
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‘‘The Prolongation of Life’’ (1908) and Lorand’s ‘‘Old Age 
Defined, The Cause of Old Age and Its Postponement by 
Hygienic and Therapeutic Measures’’ (1912). Since men have 
always shown a greater interest in life’s duration than in its 
quality, such books appear with unfailing regularity. Serge 
Voronoff’s ‘‘Life, A Study of the Means of Restoring Vital 
Energy and Prolonging Life’’ and M. Gallichan’s ‘‘ Youthful 
Old Age; How to Keep Young’’ are recent variations on the 
same theme. 

One of the first psychometric studies was Beeson’s ‘‘ Intelli- 
gence at Senescence’’’ (1920). Twenty cases, of average age 
75, in a highly selected home for the aged, yielded an average 
IQ of 83 on the Stanford-Binet. This low score Beeson attrib- 
uted to decline in mental processes at senescence. No deteriora- 
tion, however, was found with advancing age, as the older half 
of the men proved more intelligent than the younger half. The 
best performance of the group was in vocabulary, a fact which 
has since played a very important part in the study of senescent 
mental efficiency. Hdwever, the size and select character of 
Beeson’s group alone reduce the value of his results. Foster 
and Taylor in ‘‘The Applicability of Mental Tests to Persons 
over 50 Years of Age’’ (1920) reported the use of the Yerkes- 
Bridges Point Scale to test a number of younger and older 
normal and psychotic persons. The study showed among the 
normal individuals a general falling off with age. But its chief 
importance lies in making a suggestion of the greatest methodo- 
logical value for future research : namely, that senescents of the 
same level of intelligence should be compared if we are going 
to measure mental efficiency, and that the determination of the 
intellectual status of an older person would be more accurate 
if two mental ages were given: one comparing him with his 
norma! contemporaries and the other comparing him with his 
own adolescent ability (or that of normal young people). 

G. Stanley Hall’s ‘‘Senescence’’ in 1922 is the most elaborate 
attempt by a professional psychologist to view decline in its 
totality, but though it is valuable as a move to take the subject 
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out of the realm of popular stereotypes, especially with respect 
to the emotional lives of the aged, it is nevertheless an armchair 
performance based on the literature of senectitude and on 
returns to a questionnaire. As for psychometric studies, Hall 
makes no mention of them, dismissing the subject with these 
words : ‘‘ There are no mental tests of general recognized valid- 
ity above the ‘teens,’ so that we have no criteria for determining 
psychological age for even the elderly’’ (p. 196). 

The Stanford University Studies in Later Maturity made 
under the direction of W. R. Miles (now of the Institute of 
Human Relations at Yale) are the most extensive investigations 
that have been undertaken thus far of the relationship of certain 
human abilities to age. A number of papers published since 
1931 report comparisons made on a total of 2500 subjects be- 
tween the ages of 7 and 94 (250 of these over 70). The intel- 
ligence measure used was the Otis Higher given as a fifteen 
minute speed test. The maximum scoring ability, whether the 
test be of general intelligence or of a specific activity, occurs 
between the ages of 18 and 49, usually in the earlier half of 
this period; 10 to 25 per cent of those in later maturity and 
old age do as well in most tests as the average in middle matu- 
rity. Motor abilities were found to reach their maximum in 
early adulthood (between 18 and 29). In the results for 400 
subjects with the speed factor eliminated, Miles showed that 
speed did not slacken before mental power. In a later paper, 
however, C. C. Miles reported that comparisons of the curtailed 
Otis, speeded and unspeeded indicated that in early and middle 
maturity speed declined faster than power, whereas in late 
maturity the decline of power was relatively more apparent. 
In general, it was felt that there was a diminution in reaction 
speed and sum of energy available for new work types rather 
than in intelligence, decline correlating with physiological 
rather than with psychological changes. It is important to 
observe that feelings of insecurity and inferiority are consid- 
ered the greatest handicap ; appropriate training and practice, 
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persistence and other stable personality factors are considered 
assets to mental longevity. 

The chief criticism of the Stanford Studies is that the sam- 
pling was a select one, the subjects having been secured through 
church groups, clubs, lodges, and philanthropic societies, and 
that they were divided according to educational levels, which 
cannot be taken as a real separation in terms of native ability. 
The Studies show definitely the gradual decline of large groups 
of the population in general ability and in specific activities, the 
variability of maturation and age-decline in separate abilities 
and even the variability of individuals on the different tests, 
but they cannot tell how much an individual has dropped below 
the efficiency to be expected from one of his intellectual level. 
Because 10 to 25 per cent of older people score as well as, or 
better than, the average of those in their prime of efficiency 
does not necessarily mean that they have managed to maintain 
this efficiency better than their contemporaries; they may, in 
fact, have had much greater capacity to start with and have 
lost far more than many of those in the same age group who 
obtained the same or worse scores. On the other hand, a man 
of sixty testing very low in comparison with twenty year olds 
might by nature be of inferior intellect but of preserved effi- 
ciency. 

The chief problem, then, in measuring deterioration at senes- 
cence is to compare the mental efficiency of various old people 
and still manage to control the original intellectual level so that 
the comparison is between persons of equal endowment. But 
we cannot use educational levels or occupational backgrounds 
as ways of insuring intellectually comparable groups. We 
must in some manner obtain an estimate of the senescent’s peak 
performance whenever it occurred. While the most definitive 
method would be a follow-up of mental power scores of indi- 
viduals from shortly after birth through senescence, this is not 
yet a feasible procedure, and it is necessary to devise a reliable 
alternative. 
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The earliest to offer such an alternative technique was H. 
Babcock (An Experiment in the Measurement of Mental De- 
terioration, 1930; Dementia Praecox, A Psychological Study, 
1933). Her ‘‘Test of Mental Efficiency’’ designed for use 
with psychotic patients is based on the assumption that de- 
terioration occurs first in new learning and last in earliest 
formed material. Terman’s vocabulary is used as a measure 
of intellectual level and a series of speed and learning tests is 
employed for the efficiency phase of intelligence. The dis- 
crepancy between the two types of scores yields a measure of 
the deterioration. 

But, it may be asked, what justification has Babcock to assume 
that the vocabulary remains constant throughout the life span? 
It is true that other writers have noted that language is rela- 
tively slow to deteriorate (Miles), that it has a high correlation 
with mental age (Terman), and that an extensive vocabulary 
does not seem to be dependent on education (O’Connor). All 
of this shows that vocabulary is an important index of intelli- 
gence. But the language ability of the aged may be too good 
to serve as an accurate control. What if it improves with 
advancing age? O’Connor pointed out that although vocabu- 
lary does not alter materially after maturity, there is an 
increase of two school years in the period 23-50. If this is so 
and we attempt to measure the discrepancy between vocabulary 
score and the average score of a series of other tests in order 
to determine failing efficiency, a false deterioration will be 
shown unless we apply a correction. But how much this should 
be and how it is to be arrived at are very controversial points. 
Proponents of the constancy thesis call attention to the difficulty 
of believing that the aged begin to fail in memory, in learning 
and in motor skills and improve only in a single ability : vocabu- 
lary. 

Jeanne Gilbert tackles this crucial aspect of the problem of 
deterioration in ‘‘Mental Efficiency at Senescence’’ (1935) 
using a revision of the Babcock Test made by the latter. Gil- 
bert selected two groups, an experimental one consisting of 175 











612 GEORGE LAWTON 


individuals of average age 63.8 and a control group of 185, 
average age 25. All subjects had clear sensory functions, spoke 
English and had no history of psychotic episodes. Illiterates 
were excluded. The groups were comparable in occupational 
status and in motivation (at least so Gilbert assumed). The 
younger and older groups were matched for vocabulary level. 
Then 33 short tests requiring timed recall of old association, 
quick perception, various types of memory, motor ability new 
learning, ete., were administered. The scores of the older 
persons on the efficiency tests were compared with those of the 
younger individuals who had received the same vocabulary 
score (i.e., were of the same intellectual level). The dis- 
crepancy of the two average scores, or ‘‘efficiency index,’’ was 
taken as that presumably between the aged person’s ability at 
senescence and as it was when he was younger. This ‘‘index’’ 
is of unusual importance in the study of the aged individual. 
To know the average score on the efficiency tests means little 
until we discover how the individual’s performance compares 
with that which is expected from one of his intellectual level. 
He might be of natively inferior intelligence but functioning 
with good efficiency or endowed with high intellect but func- 
tioning with very poor efficiency. 

Though Gilbert believes that her own study confirms the 
claim that vocabulary remains constant after maturity has been 
reached, for the sake of safety she offers her efficiency indices in 
two forms: in one table there is no correction for a possible 
improvement in vocabulary ; in the other, going O’Connor one 
better, she marks down in vocabulary all of the older groups 
by three years instead of two, though she says such a large 
correction is unnecessary. 

However, in both tables, there is shown a large and reliable 
difference in the mental efficiency of persons in the sixties and 
persons in the twenties. The efficiency index of no young per- 
son falls as low as the mean of the older group ard only two 
older persons reach the mean of the younger group. 
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There is a definite decrease in efficiency for all levels of intel- 
lect but there is a marked tendency for those of the very highest 
levels of intellect (largest vocabulary) to show relatively less 
decrease of efficiency than those of less intellect. Apparently 
the greater the intellect with which one is originally endowed, 
the less he tends to deteriorate or the longer he tends to retain 
his original equipment of efficiency. 

One important result of Gilbert’s study is to show that indi- 
viduals vary in efficiency and that age alone is an insufficient 
factor by which to judge one’s ability to meet the requirement 
of a particular vocation. This study is in line with the results 
of other similar investigations: an arbitrary retirement age of 
65 ought to be supplanted by a more flexible system. While 
some sixty year olds are unable to handle their tasks with their 
usual ability, others can pursue their work with accustomed 
efficiency, still others can work, but at a somewhat lower level. 
But even if the older worker has lost a small measure of the 
speed which was his at an earlier age, this may be compensated 
for by additional experience, judgment, ete. If an injustice 
is done by our industrial system by arbitrarily excluding per- 
sons in the sixties from labor which some might still do, what 
shall we say of the compulsory elimination from industry of 
the forty-year old worker which shelves persons still in their 
prime? 

Gilbert’s study is an important one and it is therefore worth 
time to consider its shortcomings. Apart from the fact that 
much more work must be done before the constancy of vocabu- 
lary can be accepted as proved and that the results may be 
subject to revision with regard to the correlation of mental age 
and vocabulary in the fifties and sixties, to the applicability of 
the Babcock test to normal subjects, to the selection of the 
battery for determining efficiency, the chief criticism is that 
the older and younger group are not strictly comparable. Gil- 
bert states that 76.5 per cent of the older group was unemployed 
at the time of the study, whereas only 38.9 per cent of the 
younger group were so situated, but dismisses as a problem for 
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future research ‘‘The relationship between unemployment and 
inefficiency, particularly among older persons in whom deteri- 
oration is expected.’’ 

However, certain pertinent questions must be raised. What 
effect has unemployment and the consequent lack of proper 
work habits and practice on the ability to handle the thirty- 
three tests of efficiency? What about the dissimilarity of the 
younger and older groups in motivation and emotional reaction 
to the tests? Employment or unemployment, regardless of 
age, are significant items to be matched if we want comparable 
groups. Moreover, we have considerable justification for be- 
lieving unemployment or a previous history of long or frequent 
unemployment periods is an even more serious threat to the 
morale of older persons than to that of younger ones. In this 
respect, Gilbert’s study suffers from a limitation which is true 
of psychometric work generally in the field of old age: little 
consideration is given to the extent to which emotional sets and 
personality patterns may affect test results. Perhaps it is too 
much to ask that groups be equated with respect to freedom 
from serious neurotic involvements as well as from a history 
of psychotic episodes. It is not too much to ask for specific 
information on how each young and old person reacted to the 
test situation as a whole and to the individual tests. Such 
qualitative information is of peculiar importance in the study 
of old people. 

The writer is of the opinion that subsequent research will 
show that economic and social insecurity play a large part in 
hastening mental deterioration in the forties and fifties and are 
particularly responsible for the marked increase in the deterio- 
ration rate of the late sixties over the early sixties, which Gil- 
bert herself notes but leaves for the future to explain. If psy- 
chometrice studies at any age level are of limited usefulness 
when socio-economic and personality factors are ignored, at 
senescence such isolated test results are of dubious validity. 

Adult Intelligence, A Psychological Study of Test Perform- 
ances by Weisenberg, Roe and McBride, published in 1936, 
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reports the results of a battery of language intelligence, educa- 
tional achievement and non-language tests given to a total of 
70 men and women ranging in age from 20 to 59, mean age 36.1. 
Because of the age span, this volume does not belong, strictly 
considered, in a survey of the research done in the field of 
senescence. However, the approach to the problem and the test- 
ing program, for the most part, are appropriate to older subjects 
and for that reason the study ought to be cited here. 

One very important methodological novelty of the study is the 
technique used to secure an unselected sample of the population. 
Theoretically, the best method would be to test, in house-to- 
house fashion, every individual in a given community. This is 
not generally feasible, though Jones and Conrad in ‘‘The 
Growth and Decline of Intelligence, A Study of a Homogene- 
ous Group between the Ages 10 and 60’’ (1933), made a sur- 
vey of almost a complete New England community. Instead, 
the authors of ‘‘ Adult Intelligence’’ worked in the surgical 
and orthopedic wards of three Philadelphia hospitals, making 
certain, of course, that the physical disabilities were not of a 
type to affect test performances. The hospital group besides 
furnishing a random sample of average and near-average per- 
sons, employed as well as unemployed, also provided subjects 
who were grateful for devices to help pass the time away and 
who were in nowise limited as to the amount of time that they 
could devote to examinations or interviews. The problem of 
securing a group of suitable subjects is especially acute at the 
senescent level, and the idea of a hospital group is a very 
useful one. 

Unlike most other studies of adult intelligence, the results of 
which are based on one or two short tests, or at any rate a 
relatively small number, the battery employed here was a very 
extensive one, consisting of 34 tests and involving 15-20 hours 
of testing time. Furthermore, the hospital situation permitted 
a more friendly and personal relationship and a more intimate 
knowledge of qualitative factors. 
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The conclusions of ‘‘ Adult Intelligence’’ are in accord with 
that of most other studies in the field. The authors find that 
the greatest extent of mental development is reached before the 
twenties and from this decade through the fifties there is little 
further gain and comparatively little decline. We again are 
shown that the peak of development and the course of abilities 
with age are different for different performances or abilities, 
and that there is considerable variation among individuals in 
achievement. Sex differences are negligible. Formal school- 
ing is not as important a factor in determining test results as is 
generally supposed. Interestingly enough, in working out the 
correlations of various tests with age, the only evidence of an 
increase in score with age appears in the vocabulary tests, 
notably the Thorndike Word Knowledge. (This is confirma- 
tion of the Gilbert position referred to earlier relative to the 
correction which it is necessary to make for the improvement 
in vocabulary in later maturity.) The most marked evidence 
of decrease in scores with age occurs in the analogies and the 
non-language tests. 

Dr. Irving Lorge in ‘‘The Influence of the Test upon the 
Nature of Mental Decline as a Function of Age’’ (1936), makes 
a valuable contribution to the methodology of studying senes- 
cence. In order to determine the relationship between age and 
various tests of mental ability, eleven tests were administered 
to a group of 143 adults ranging in age from 20 to over 70. 
Some of these were tests of power, others a mixture of power 
and speed. Lorge maintains that the existence of a high cor- 
relation, in some cases as high as .85 between power and power- 
speed tests of ability (i.e., between IER Intelligence Scale 
CAVD and Otis Self-Administering Test of Mental Ability 
Higher Examination, 20 minute time allowance), is not suffi- 
cient to prove that each test measures the same thing. Indeed, 
the relationship of each type of test to age differs markedly. 

The results of Lorge show that the relationship between age 
and ability varies as the test used to measure the ability. By 
matching similar scores on CAVD in three age groups (20-25, 
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27.5-37.5, over 40) with scores obtained on the other tests, he 
was able to determine the penalty which age imposed upon the 
particular measure of ability which is employed, in this case a 
mixed power-speed test. He applies his findings to the Jones 
and Conrad study already cited in this article and to the Miles 
and Miles investigation, ‘‘ The Correlation of Intelligence Scores 
and Chronological Age from Early to Late Maturity’’ (1932), 
employing in the case of the former a correction of .780 points 
per year after age 20 and in the later study a correction of .480 
points. In both instances, he finds that the adjusted means do 
not show the same gradual decline exhibited in the obtained 
scores. 

Lorge concludes that speed obscures mental power per se in 
older adults and a correction must be applied to take care of 
those physiological, educational or emotional changes that inter- 
fere with the older person’s power to cope with mental tasks. 
Such a correction may alter the curve of mental decline to a 
curve of mental plateau or even to a curve of mental growth, 
though Lorge admits that mental growth in the later age brack- 
ets may be more the result of the special selection of death 
(death coming earlier to the physically and mentally mediocre) 
than a true growth. The reported facts of mental decline as 
concomitants of age are more apparent than real and are due to 
the fact that the test used to measure mental ability is not a 
genuine measure of mental power. 

We have seen that the problem of evaluating senescence or 
deterioration cannot be met without the use of tests that have 
been standardized on adults and for which adequate norms, at 
different age levels, are available. Dr. David Wechsler’s 
‘*Bellevue Intelligence Examination,’’ to be published in the 
fall of 1938, will make available for the first time data of this 
kind, having norms for all ages up to 70. Any discussion of 
this test, however, must be postponed until its official pub- 
lication. 

In an address made in October, 1937, before the New York 
Neurological Society, Dr. Wechsler offered certain observations 
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concerning the approach to the study of senescence which are 
of considerable importance. He pointed out that ‘‘normal 
deterioration’’ is hardly different qualitatively or quantita- 
tively from the deterioration regarded as pathological by 
psychiatrists and neurologists. For deterioration, however 
defined, does not begin in senescence or senility but starts im- 
mediately after the point of maximal virility.. This point for 
most human abilities and most individuals begins somewhere in 
the twenties. Mental deterioration of the ‘‘no ”” variety, 
i.e., not caused by disease processes or injury, does not appear 
suddenly as a precipitate drop in the developmental curve. It 
is a gradual process, though as aging progresses, it may in- 
erease in momentum. This point is well brought out by 
Wechsler in Chapter 7 of his book The Range of Human Capaci- 
ties (1935). It is worth noting that Dr. Alfred E. Cohn of 
the Rockefeller Institute working in the field of physical aging, 
principally heart disease, makes exactly the same point con- 
cerning the gradual decline in the functioning of the heart 
(‘The Aging of the Heart Muscle Regarded from a General 
Biologie Point of View,’’ 1929). 

We may sum up our discussion of the mental functioning of 
old people by saying no adequate study of their mental abilities 
and performances yet exists. The first requisite for such a 
study is a satisfactory psychometric tool; a testing program 
which will compare the old person not only with his normal 
contemporaries but with his own abilities as they were when 
he was young. In this testing program due consideration must 
be given to two facts: first, different test performances reach 
their peak at different ages and show different courses through 
maturity; second, even those performances which seem to 
decline rapidly after maturity in relation to other test perform- 
ances are in reality seen to decline slowly when compared to 
the rate of development. Hence our norms must be based on 


large numbers of persons of every age level from early to late 
maturity and senescence. 
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Each type of activity must be measured by itself at sufficient 
length and with a sufficient number of cases properly selected 
to afford a reliable estimate. We still have to learn something 
of the relationship of various mental abilities at the adult level, 
to say nothing of the senescent level with all its special problems. 

But even when an adequate psychometric approach will have 
been devised, it will have to be properly applied: i.e., the part 
that emotional, social and other non-intellectual factors play in 
affecting test results must be suitably evaluated. Test results 
simply as test results are even more suspect at the senescent 
level than at earlier periods. 

Facts about the mental abilities of old people are of great 
importance in mental hygiene and in social planning. Only 
when we have reliable data here, can we make a just decision 
as to the work which old people can do, and as to which ones 
can continue working past the technical retirement age, which 
ones should enter upon a modified work program at such a time 
and finally which persons should be given simpler tasks even 
before the retirement age has been reached. Our knowledge 
today about the possibilities of such modified work programs 
for older people is very scanty and unreliable. In a planned 
society no individual is completely scrapped. Depending on 
the nature and extent of the deterioration every individual can 
perform some socially useful work no matter how elementary or 
limited it may be. 

















JUDGING EXPRESSIVE MOVEMENT: III. 
JUDGMENTS OF DOMINANCE-FEELING 
FROM PHONOGRAPH RECORDS 
OF VOICE’ 


PHILIP EISENBERG and ELEANOR ZALOWITZ 
Columbia University 


THE PROBLEM 


E have previously studied the question how accurately 
and in what ways people judge dominance-feeling 
from the expressive movements, handwriting and 

gait.?, In this paper we are continuing the investigation of the 
general problem of judging personality from expressive move- 
ment by studying judgments of dominance-feeling from phono- 
graph records of voice, another experimentally isolated ex- 
pressive movement. This investigation is of special impor- 
tance, since, in forming our impressions of our fellows from 
their expressive movements, voice plays a large role. Here 
we are not interested in the content of speech, but in the man- 
ner and qualities of speech, for example, loud or soft voice, 
fast or slow, smooth or hesitant, even or uneven tempo, etc. 
Though this may seem to imply that we analyzed isolated 
features of voice, actually we permitted our judges to form 
an impression from the entire voice. 

1 The authors are indebted to Dr. Otto Klineberg and Dr. Max Hertz- 
man for their advice and assistance. 

2 Eisenberg, P. Judging Expressive Movement: I. Judgments of Sex 
and Dominance-Feeling from Handwriting Samples of Dominant and Non- 
Dominant Men and Women. Journal of Applied Psychology, 1938, 
Vol. 22, No. 5, pp. 480-486. Eisenberg, P, and Reichline, B. B. Judging 


Expressive Movement: II. Judgments of Dominance-Feeling from Motion 
Pictures of Gait. Journal of Social Psychology in press. 
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As a secondary problem, which we have not investigated 
previously, we studied the consistency of judgment over a 
short period of time. 


PROCEDURE 


On the basis of the Maslow ‘‘Social Personality Inventory,”’ 
a rating scale of dominance-feeling (both of these being ad- 
ministered twice) and an interview, 8 extremely dominant 
and 8 extremely non-dominant Barnard College women were 
selected from a group of 238.5 We obtained a one minute rec- 
ord of each subject reading a description of a character from 
Mann’s ‘‘The Magic Mountain.’’ The subjects were well 
aware that records were being made of their voices, but in- 
stead of trying to conceal the fact, we preferred to utilize this 
impressive situation believing that it would even more serve 
to differentiate the extremes. 

The records were played for a group of judges, 43 psychol- 
ogy students at Brooklyn College,* in the following random 
order: D, ND, D, D, ND, ND, D, D, ND, D, ND, D, ND, D, 
ND, ND. After the playing of each record, the judges were 
asked to decide whether the girl was dominant or non-domi- 
nant. After an interval of 30 minutes, during which judg- 
ments were made from motion pictures,® the records were 
played in another random order, and judgments were made. 
At the end of each presentation, the judges were asked to 
write what were considered to be the characteristics of voice 
of dominant and non-dominant women. The following defi- 
nition of dominance-feeling was given to the judges: 

8 For an elaboration of the method of selecting extremes of the distri- 
bution of dominance-feeling, see Eisenberg, P. Expressive Movements 
Related to Feeling of Dominance. Arch. Psychol., 1937, No. 211. 

4 We are grateful to Dr. Austin B. Wood for his assistance in securing 
subjects. 

5 Subjects were required to judge dominance-feeling from motion pic- 
tures of facial expression, but due to.a serious error in splicing the films, 
these data could not be used. 
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‘*Feeling of dominance is defined as general self-confidence, 
and high level of self-evaluation. It includes a forcefulness of 
character, ‘strong’ personality, a feeling of being ‘on top,’ a 
feeling of mastery over others, a feeling of general capability. 
It may assume several forms in different people: despotism, 
aggressiveness, helpful leadership, guidance, or just feeling 
dominant with no active expression. . . . Low dominance-feel- 
ing, in general, consists of the reverse or absence of these feel- 
ings. It includes feelings of general inferiority, lack of self- 
confidence, uncertainty, feeling dominated by others, timidity 
and shyness, often a willingness to be self-effacing, and self- 
consciousness. ’”* 


RESULTS 
1. How well can people judge dominance-feeling from phono-. 
graph records of voice? Referring to Table I, we see that in 


TABLE I 
Average Judgments of Dominance-Feeling for Two Presentations 




















PRESENTATION N poy ‘cain c* 
Bs ae ie Nay 43 5.93 62.94 .26 
i RRR Re Read 40 6.25 60.94 22 





* Vernon’s method of computing coefficients of contingency was used, 
op. cit. 


both presentations dominance-feeling was judged a little better 
than average chance. However, we must consider this only a 
tentative conclusion until we make a further analysis. (See 
question 5.) It is interesting to compare our C’s of .26 and .22 
for both presentations with a C of .29 which Allport and Can- 
tril’ obtained when voices were matched with scores on the All- 
port A-S test. However, we cannot make a direct comparison 


¢ For an elaboration of the concept of dominance-feeling, see Maslow, 
A. H. Dominance-Feeling, Behavior and Status. Psychol. Rev., 1937, 44, 
404429, 

7 Allport, G. W. and Cantril, H. Judging Personality from the Voice. 
J. Soc. Psychol., 1934, 5, 37-55. See also, Vernon, P. E. The Matching 
Method Applied to Investigations of Personality. Paychol. Bull., 1936, 
33, 149-177. 
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since Allport and Cantril utilized different experimental con- 
ditions and another personality trait. 

2. How consistent are the judgments of voice over a period of 
time? Referring to Table I, we see that there were slightly 
more errors made in judging the second time. However, the 
difference between the average number of errors of both presen- 
tations is not statistically reliable (critical ratio .97). The 
slight increase in errors was most probably due to less attention 
because of fatigue and boredom. 

However, the correlation between both presentations for total 
number of errors was only .42+ .09. As a reliability figure 
this is exceedingly low and we might reasonably conclude from 
this that the judgments of voice were rather inconsistent. 

But because of the peculiar conditions of this experiment, it is 
necessary to analyze consistency in absolute rather than relative 
terms, that is, in terms of actual identity of response for each 
judge and each record instead of a correlation measure. When 
we make this analysis, we find that 69.43 per cent of all re- 
sponses were identical in both presentations. From this we 
may more reasonably conclude that judgments of voice are 
fairly consistent, that is, that the judgments are not made by 
chance but on the basis of some definite criteria. Just what 
those criteria are we shall see later (question 5). It is very 
doubtful that memory played any significant part in this iden- 
tity of judgment. 

3. Do women judge better than men? Comparing 13 men 
with 30 women in the first presentation, it was found that there 
was no statistically reliable difference in the average number of 
errors made between these two groups (critical ratio .89). That 
women can judge dominance-feeling from phonograph records 
of voice no better than men must remain a tentative conclusion 
in the light of the small number of cases. However we obtained 
the same result in connection with judgment of gait, and other 
experiments in judging support this conclusion.*® 

4. In what respects do the good and poor judges differ? The 
judges were required to write free descriptions of what they 

8 Eisenberg and Reichline, op. cit. Vernon, op. cit. 
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considered to be the characteristic differences between the voices 
of dominant and non-dominant women. In making a qualita- 
tive comparison of the extremely good and extremely poor 
judges, we arrive at the following tentative conclusion: The 
good judges tend to give a fuller, more detailed description than 
the poor judges. Their descriptions are more objective and 
directed to the subtler aspects of expression. However, these 
differences are very slight and must be considered with caution, 
due to the small number of cases. 

5. In judging indwidual records, do the judges reveal stereo- 
typed attitudes? We have answered only partially the ques- 
tion of how well individuals judge dominance-feeling from 
phonograph records of voice, and in what ways they arrive at 
their judgments. An analysis of individual records may throw 
some further light on these questions. 

When we correlate the judgments of individual records for 
both presentations we obtain a correlation of .75 + .08, indicat- 
ing that though individuals may differ to some extent when 
making judgments from time to time, the group as a whole is 
making about the same judgments for individual records. This 
suggests that there is something about the record and the stereo- 
typed attitude the judges bring with them that influences their 
judgment to uniformity. 

Table II indicates that there is a wide range in the ease of 
judging any record. We have combined all the judgments for 
each record for both presentations and computed the reliability 
of the difference from chance, whether the judgments were cor- 
rect or incorrect. (This combination is justified by the correla- 
tion of .75 above.) It was found that records 1, 2, 4, 10, 13, 14, 
15 and 16 were judged reliably better than chance; records 3, 5 
and 11 reliably worse than chance; and the other five records 
just about chance, using a critical ratio of 2.50 as a dividing 
line. 

When we examine the records themselves, we find that the 
judges are influenced by the records themselves and their pre- 
conceptions rather than by the actual personalities of the women 
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TABLDE II 
Reliability of Judgment of Individual Records 





No. PER CENT D FROM 
ERRORS CORRECT* 50% @ 
24 71.08 21.08 

2 97.59 47.59 

60 27.71 22.29 
23 72.29 22.29 
70 15.66 24.34 
39 53.01 3.01 
30 62.86 12.86 
56 32.53 17.47 
46 44.58 5.42 

8 90.36 40.36 

68 18.07 31.93 
27 67.47 17.47 

2 97.59 47.59 

24 71.08 21.08 
16 80.72 30.72 

3 96.39 46.39 























* Of 83 possible judgments. 


judged. We find in the eight records judged better than chance 
that four are dominants and four non-dominants. The four 
dominants happen to have voices which fit the preconception of 
dominant voice, that is, their speech is dramatic, well-modu- 
lated, non-hesitant, in general, ‘‘good’’ speech, and also possess- 
ing mature qualities of voice, not shrill and high-pitched. The 
non-dominants happen to have the opposite qualities of voice. 
Of the three judged reliably worse than chance, No. 3, a domi- 
nant, has all the stereotype characteristics of a non-dominant, 
and No. 5 and 11, non-dominants, the characteristics of domi- 
nant voice. Those judged just about chance have some domi- 
nant and some non-dominant characteristics so that the subject 
will be called a dominant or non-dominant depending on what 
the judges feel is most important at the moment of judging. 
For example, No. 7, a dominant, has all the characteristics of 
good speech, but her voice is immature, even childish and 
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shrill ; whereas No. 12, also a dominant, has a mature and deep 
voice, but she does not read well, since she hesitates and falters 
frequently. 

It seems then that the judges did not really know what was 
the characteristic expression of a dominant girl as distinguished 
from a non-dominant girl, and were judging by the most obvious 
characteristics, expressing their stereotyped attitudes largely 
on whether the speech was ‘‘good’’ or ‘‘bad’’ speech, and on 
the maturity of the voice, these being synonymous for them 
with dominant and non-dominant speech. It is also probable 
that in enumerating the characteristics of the dominant as 
opposed to the non-dominant, they were expressing this stereo- 
typed attitude rather than the subtleties of expression. This 
tends to confirm our suspicion that the good judges were those 
who were more attentive to these subtle aspects of expression, 
in spite of the fact that they could not verbalize any better than 
the poor judges. 

This analysis brings us to the conclusion that we obtained 
judgments a little better than chance merely because of a few 
records ‘‘easy’’ to judge. And we find in Table III that when 


TABLE Ill 
Percentage of Incorrect Judgments After Eliminating the Records 
** Fasiest’’ to Judge 











PER CENT INCORRECT 
NO. OF RECORDS 
ELIMINATED Pamneqeetens Preseqeetion 
_ fy RS BAR SSE a 37.21 39.06 
5 Tae Renee 39.68 41.67 
i scdhbaien> 42.36 44.46 
Be ceecubianaiens 45.26 47.69 
— eee 48.26 51.04 











we eliminate one by one the four ‘‘easiest’’ records to judge, 
we approach chance judgments very rapidly, considering 50 per 
cent as chance. It might also be added, that if the judged 
groups were matched for reading ability and maturity of voice, 
the judges would have made more nearly chance judgments. 
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6. What is the relationship between social agreement and cor- 
rectness of judgment? When we correlate the number of errors 
with the number of identical judgments made for individual 
judges we obtain a correlation of — .09 + .11 for the first presen- 
tation and .38 + .09 for the second. In the light of the change 
of sign and the size of the P.E.’s we may consider this a zero 
relationship. 

However, when we correlate the percentage of correct judg- 
ments made on individual records with the percentage of iden- 
tical judgments combined for both presentations we obtain a 
correlation of .53 + .12. The percentage of identical judg- 
ments for individual records can be considered a measurement 
of social agreement since the group as a whole is making about 
the same judgments and revealing that they are basing their 
judgments on the same preconceptions. If that is so, we may 
conclude from the above correlation that social agreement is to 
some extent a measure of accuracy of judgment. But when we 
examine Table IV we find that the records judged reliably better 


TABLE IV 


Relation Between Reliability of Judgment of Records and Identity 
of Judgment Over a Period of Time 





NO. OF AVERAGE % 
RELIABILITY OF JUDGMENT RECORDS IDENTITY 


Better than chance .................. 8 78.71 


5 52.97 
3 72.07 

















than chance and the records judged reliably worse than chance 
have the highest percentage of identical judgments, whereas 
the records judged about chance have the lowest percentage of 
identical judgments. If there were an equal number of records 
judged better than chance and worse than chance, then the 
above correlation would probably have been zero. 

These results agree with the findings of Allport and Cantril,° 

® Allport and Cantril, op. cit. 
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Taylor’® and Stagner™* who had subjects judge various aspects 
of personality from voice. They conclude that the degree of 
social agreement is greater than the degree of correctness of 
judgment, and that this is due to the fact that judgments are 
based on a preconceived stereotyped attitude. Taylor states: 
‘‘There is a tendency for auditors to be most consistent (with 
each other) in their judgments when they agree least with the 
subject’s self-ratings. It is at least clear that their degree of 
consistency in judging a given trait bears no positive relation- 
ship to their accuracy in judging that trait.’’ 


DISCUSSION 


We have found that people judge dominance-feeling from 
phonograph records of voice, just as in the case of judgment 
of handwriting and gait, a little better than chance, but when 
we analyze individual records we find that they make their judg- 
ments on the basis of preconceived notions which may. or may 
not correspond to reality. It is not necessary to repeat here 
our discussion of judgment of gait, but all our remarks are 
relevant to the judgment of voice. 

It is interesting, in summary, to compare the judgments of 
handwriting, gait and voice. All judgments were made of 
extremely dominant and non-dominant women, and though 
different groups of judges were used the following percentages 
of correct judgments were obtained: handwriting 64.00 per 
cent, gait 63.89 per cent, and voice 62.94 per cent. The similar- 
ity in percentages is rather striking. If the similarity is not 
coincidental, it can be interpreted to mean (1) that the judges, 
using the same preconception of dominance, arrive at approxi- 
mately the same decisions regardless of the expressive move- 
ment judged, and (2) that there is some element of reality in 
this stereotype but not enough for prediction or to be able to 


10 Taylor, H. 8. Social Agreement on Personality Traits as Judged 
from Speech. J. Soc. Psychol., 1934, 5, 244-248. 

11 Stagner, R. Judgments of Voice and Personality. J. Hd. Psychol., 
1936, 27, 272-277. 
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use the matching method as a technique for measuring the inter- 
relationship between various aspects of personality. 

These interpretations seem plausible, since we, as well as other 
investigators, have found a high degree of social agreement in 
judgment of expressive movement. That social agreement 
bears no relation to correctness of judgment is an important 
finding and should disturb the faith of those who believe that 
if we have enough judges who agree we can learn the truth. 
This is nothing more than another illustration of Sherif’s*? 
thesis that any group of people living under the same cultural 
conditions will develop social norms, not only of conduct, but 
of perception and judgment, which will tend to make these psy- 
chological processes uniform, but not necessarily realistic. 

This conclusion becomes even more plausible when we find 
what this preconception or social norm is. In a previous study*® 
it was found that almost all subjects, both men and women, con- 
sider dominance desirable and non-dominance undesirable. 
With this as a major clue, we find that our material illustrates 
a principle of judgment discovered by Asch, Block and Hertz- 
man™ in connection with judgments of photographs, profes- 
sions, political figures and political slogans, namely that ‘‘the 
judgments of a single situation are related to each other by a 
person in accordance with an underlying attitude of acceptance 
or rejection.’’ This is perhaps most clearly expressed in the 
judgment of voice where ‘‘good’’ speech and mature voice was 
considered dominant speech and ‘‘poor’’ speech and immature 
voice, non-dominant. In the judgment of gait, a smooth gait, 
with an even tempo, associated with an attractive-looking girl, 
obviously desirable characteristics, was considered to be domi- 
nant gait. Palmer-method handwriting was considered to be 
non-dominant handwriting, but Palmer writing is definitely 


12 Sherif, M. The Psychology of Social Norms. Harper, 1936. 

18 Eisenberg, P. Arch. Psychol., op. cit. 

14 Asch, 8. E., Block, H. and Hertzman, M. Studies in the Principles of 
Judgments and Attitudes: I. Two Basic Principles of Judgment. J. 
Psychol., 1938, 5, 219-251. 
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undesirable to our subjects, particularly if one recalls the tor- 
tures connected with it in elementary school. Desirable hand- 
writing is distinguished, irregular and unusual, and that was 
considered to be dominant handwriting. In general, since there 
is a general underlying attitude of acceptance of dominance and 
rejection of non-dominance in the judgment of dominance- 
feeling from various expressive movements, all desirable quali- 
ties are ascribed to the dominant individual and undesirable 
qualities to the non-dominant individual. 

In the light of all these data, one would hesitate to make judg- 
ments of personality from expressive movements alone. Yet 
the personnel manager or the employer often hires or rejects 
an applicant on just such flimsy evidence, by such impressions 
that teachers are found pleasing or displeasing by their stu- 
dents, ete. When we realize that these judgments are based 
on stereotype preconceptions and that the desires of the indi- 
vidual are projected into his judgment, we can well realize all 
the errors that are made. This does not mean that expressive 
movements cannot be used in describing the personality of an 
individual. Relationships have been found, and it may prove 
to be very useful in the study of behavior mechanisms, but 
more objective methods of measurement must be used and 
subtler analyses made before we can come to any conclusions. 
We cannot rely on the process of judgment alone, not even on 
the judgment of many people. 


SUMMARY AND CONCLUSIONS 


Phonograph records of the voices of 8 extremely dominant 
and 8 extremely non-dominant girls were played for a group 
of 43 college students to judge for dominance-feeling. After 
an interval of 30 minutes, the records were played again in a 
new order. It was found that: 

1. Dominance-feeling was judged a little better than chance. 

2. The judgments were found to be fairly consistent over a 
short period of time, for though the correlation between both 
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presentations for total number of errors was only .42, 69.43 per 
cent of the judgments were identical for both presentations. 

3. Women did not judge any more accurately than men. 

4. There was a slight tendency for good judges to make a 
more detailed description of voice, directed toward the subtler 
aspects of expression. — 

5. An analysis of individual records revealed that: (a) For 
both presentations, the group made approximately the same 
errors on individual records. (b) Wide variations in ease of 
judging records were found, 8 records being judged reliably 
better than chance, 5 about chance, and 3 reliably worse than 
chance. (c) When we analyze individual records we find that 
judgments were made chiefly on the basis of preconceived 
notions of expressive behavior, and that the good judges were 
more aware of the subtler aspects of expression. (d) If the 
three or four records ‘‘easiest’’ to judge are eliminated, we 
obtain judgments no better than chance. 

6. There is no relation between social agreement and correct- 
ness of judgment. 

7. A comparison of judgments on handwriting, gait and voice 
indicates that judgments are organized on the basis of a general 
attitude of acceptance of dominance and rejection of non-domi- 
nance, with a concomitant ascription of desirable qualities to 
the dominant individual and undesirable qualities to the non- 
dominant individual. 
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PREDICTION OF ACHIEVEMENT IN FOREIGN 
LANGUAGES 


MAY V. SEAGOE 
University of California at Los Angeles 


REDICTION of success in foreign languages is of major 
importance for two reasons. First there is the high pro- 
portion of failures that usually occurs with its waste of 

students’ time, teachers’ energy, and taxpayers’ money. Sec- | 
ond, more and more colleges are willing to waive the language 
requirement for students who have proved themselves in other 
fields. Language is no longer inevitable: it can be avoided by 
those for whom it has little value and who show little aptitude. 

Many studies have been made in the field, most of them ably 
summarized by Kaulfers.. The median coefficients of correla- 
tion with bearing on the present study are .60 with prognosis 
tests, .49 with achievement in algebra, .46 with achievement in 
English, .385 with achievement in reading, .356 with intelligence 
tests, .164 with achievement in arithmetic, and —.24 with 
chronological age. The great range of coefficients obtained by 
various investigators, sometimes varying from low negative to 
nearly perfect positive correlation for a single characteristic, is 
noteworthy. 


PROBLEM 


The data available promised to shed some additional light 
on the subject, mainly because the method of administering 
tests more adequately represented the usual junior high school 
procedure than do most studies; guidance for ninth grade was 
based largely on test data gathered at entrance to the seventh 
grade. Some of the specific questions suggested were: (1) Is 

1 Kaulfers, Walter Vincent. ‘‘ Present status of prognosis in foreign 
language.’’ School Review, 1931, 39, 585-596. 
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the Luria-Orleans Prognostic Modern Languages Test superior 
to such general intelligence tests as Terman, Otis and Kuhl- 
mann-Anderson in predicting success in languages? (2) Is 
the New Stanford Reading Test superior to a general intelli- 
gence test for this purpose? (3) Is it possible to establish 
fairly definite critical scores differentiating potential failure 
from satisfactory and satisfactory from recommended grades? 


SUBJECTS, MATERIALS, AND PROCEDURE 


The subjects were 120 students, 64 boys and 56 girls, whose 
records were being followed in connection with the Carnegie 
Guidance Experiment in Pasadena, California.? In this proj- 
ect all students entering 7-1 in September, 1934, were desig- 
nated as the experimental group, a group which was followed 
for three years with a more than normally intensive guidance 
service. 

The tests used in this study include the Terman Group Test 
given in May, 1934, just before seventh grade enrollment, the 
Kuhlmann-Anderson Intelligence Test given in May, 1936, and 
the remainder drawn from the general testing program carried 
on during the fall of 1934. Tests from the general program 
referred to in this article include the Otis Intermediate Exami- 
nation, the New Stanford Achievement Test (sections on read- 
ing, physiology and arithmetic), the Luria-Orleans Modern 
Languages Prognosis Test, the Stenquist Mechanical Aptitude 
Test, and the Orleans Algebra Prognosis Test. The mathe- 
matics tests were included to determine their comparative rela- 
tionship to or independence of the foreign language prediction, 
and the physiology and mechanical aptitude tests were included 
to explore the relation of foreign language achievement to such 
theoretically unrelated factors as scientific and practical ability. 

Following the administration of each test the results were 
reported to the individual concerned and their significance for 
his educational and vocational plans was discussed. In case 


2The Experiment was directed by Dean Grayson N. Kefauver of the 
School of Education at Stanford University. 
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the individual planned to take a language in the ninth year or 
was uncertain whether he should elect it, he was advised to 
enroll in the 8-2 Exploratory Languages course which gave six 
weeks each to Latin, Spanish, and French. At the end of the 
semester he made the decision concerning ninth grade language 
on the basis of his knowledge of his own abilities based on both 
experience and standardized test data, and on his educational 
plans. The choices were reviewed by the counselor on the same 
bases, and individuals whose choices seemed ill-advised were 
interviewed. The advice was based on the rule-of-thumb as- 
sumption that the result of the intelligence tests should indicate 
an IQ of 110 or higher, that the Luria-Orleans result should 
place the individual in the upper half of his class, and that the 
accomplishment in the exploratory languages course should be 
at least normal if the student is to succeed in a language. Such 
broad limits seemed justified in a school where approximately 
forty per cent of the pupils enter college. Exceptions were 
made where the pupil’s home background, reputation as a stu- 
dent, apparent special aptitude in the field, or fixed vocational 
or educational objectives seemed to demand. 

Teachers’ marks at the completion of the first semester’s work 
in the three languages constituted the criterion of suecess. They 
were stated only as ‘‘ Recommended’’ or ‘‘Satisfactory’’ for the 
student’s information in planning for tenth grade work since 
a personal letter took the place of the grade-report and the only 
official record was one of granting or refusing credit. 


RESULTS 


Derived rather than primary scores have been used because 
of their availability in intelligence and achievement tests. The 
result would theoretically be to underestimate relationships and 
critical ratios. The use of the intelligence quotient rather than 
mental age would be questionable under usual school conditions, 
but in this case a non-failure program has so reduced the differ- 
ences in chronological age that the IQ is proportional to the 
mental age. 
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Intereorrelations between the tests analyzed have been pre- 
sented in a recent article by the same author. The most signifi- 
cant fact is that the highest degree of relationship is among the 
intelligence tests, the next between intelligence and the verbal 
group, next between intelligence and the mathematics and 
physiology group, and that there is no relation between intelli- 
gence and mechanical aptitude. The r is .63 + .03 between 
reading achievement and language prognosis, .73 + .03 between 
Luria-Orleans and Terman, .71 + .03 between Luria-Orleans 
and Otis, and .68 + .03 between Luria-Orleans and Kuhlmann- 
Anderson. It is evident that there is an unreliable difference 


TABLE I 
Description of Groups 























COMPARISON IN SUB-GROUP MEAN — ae RANGE 
Recommended | 167.2 5.0 0.8 155-178 
Chronological Age | Satisfactory 168.9 4.8 1.3 159-175 
Not Elected 172.4 6.7 0.8 159-197 
Recommended | 122.4 | 13.3 2.3 100-149 
Terman IQ ~ | Satisfactory 111.1 6.7 1.8 100-122 
Not Elected 104.2 10.5 1.3 69-126 
Recommended | 127.6 | 143 2.4 97-153 
Otis IQ Satisfactory 114.6 11,7 3.2 95-137 
Not Elected 101.8 | 13.6 1.6 74-130 
Recommended | 122.4 | 14.9 2.5 94-154 
ao Satisfactory | 1104 | 88 24 | 95-127 
Not Elected 103.5 9.5 1.1 82-127 
Stanford Reading | Eetistactory | 84 | 10 | 03 | 63-09 
Grade Placement af ; ~ , oe 
Not Elected 7.2 14 0.2 4.4—10+ 
Luria-Orleans Recommended 90.3 | 33.4 5.7 32-167 
Modern Languages | Satisfactory 87.1 | 18.2 4.9 54-121 
Prognosis Seore Not Elected 50.0 | 24.3 2.9 10-109 
Stanford Physiol- Recommended 8.2 1.5 0.3 5.2-10+ 
ogy Satisfactory 7.4 1.3 0.4 5.9-10+ 
Grade Placement Not Elected 6.4 1.4 0.2 3.2-10+ 
Stencuist Mechani- | Recommended 56.8 | 10.7 1.8 32-82 
eal Satisfactory 54.6 9.6 2.6 40-71 
Aptitude Score Not Elected 58.8 11.9 1.4 28-78 
Stanford Arith- Recommended 8.3 1.3 0.2 6.0-10+ 
metic Satisfactory 7.3 0.7 0.2 6.3-8.7 
Grade Placement Not Elected 6.7 1.2 0.1 4.3—10+ 
Recommended 78.8 | 33.4 5.7 33-163 
Pricans Algebra =| Satisfactory | 57.9 | 266 | 7.2 | 35-126 
“ Not Elected 41.7 | 16.6 2.0 | 14-84 
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between the three intelligence tests used, and the reading grade 
placement is a close second as a basis of prediction. 

Table I presents a detailed description of the test data segre- 
gated according to record in language. The ‘‘Recommended”’ 
group consists of the 34 students who elected a language and 
did work of superior quality, usually defined as A or B. The 
‘*Satisfactory’’ group consists of the 14 students who elected a 
language and did work of passing but not recommending qual- 
ity, usually defined as C or D. The ‘‘ Not Elected”’ is a hetero- 
geneous group consisting of the 72 students who did not take a 
foreign language, either because it was unnecessary in their edu- 
cational and vocational plans or because they showed no apti- 
tude for it. Since there was only one failure, insufficient to - 
establish any generalizations, that case was dropped from the 
tally. 

There seems to be some systematic differentiation for all eri- 
teria except Stenquist Mechanical Aptitude score. The extent 
to which the differences are reliable is presented in Table II. 
The letter preceding the number indicates the superior group, 
and the probability column shows the chances in 100 that the 
obtained difference is a true difference. 

In differentiating the Recommended from the Satisfactory 
group, the reliable tests in order of decreasing excellence are 
Terman, Kuhlmann-Anderson, Stanford Arithmetic, Otis and 
Stanford Reading. The Luria-Orleans Language Prognosis is 
the lowest on the list, perhaps because it claims to predict only 
for modern languages while data from Latin are included, thus 
decreasing the validity. The difference between the Recom- 
mended and Not Elected groups is greater, the reliable tests in 
order being Otis, Stanford Reading, Terman, Kuhlmann-Ander- 
son, Luria-Orleans Language Prognosis, Orleans Algebra Prog- 
nosis, Stanford Arithmetic, and Chronological Age. Only the 
Stenquist fails to show a significant difference. The difference 
between the Satisfactory and Not Elected groups is reliably 
shown by the Luria-Orleans Language Prognosis, Stanford 
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Reading, Otis, Terman, Kuhlmann-Anderson, and Stanford 

In terms of the questions proposed, a general intelligence 
test was superior to the Luria-Orleans in two out of three dis- 
criminations and inferior in one. The same thing is true of 
the Stanford Reading Test. The best generalization seems 
that the three types of tests are equaliy valid, but the Luria- 
Orleans is somewhat more variable in result. 

Though these data establish reliable differences between total 
groups, Table I also showed the great overlapping. Table III 























TABLE III 
Critical Scores 
nn PROBABLE 
TEST OR TRAIT NON-ELECTION | UNCERTAIN | RECOMMEN 
Chronological Age ccc 179-197 159-178 155-158 
*Terman IQ 69-99 100-126 127-149 
*Otis IQ 74-94 95-137 138-153 
*Kuhlmann-Anderson IQ .......... 82-93 94-127 128-154 
“Stanford Reading G.P. ........ ; 4.4-6.2 6.3—10+ ? 
*Luria-Orleans Language ........ 10-31 32-121 122-167 
Stanford Physiology G.P. ........ 3.2-5.1 5.2-10+ ’ 
Stenquist Mechanical Apt. ..... 28-31 32-78 79-82 
*Stanford Arithmetic G.P. ..... 4.3-5.9 6.0—10+ t 
Orleans Algebra Score .............. 14-32 33-126 127-163 





shows for the various tests the limits below which non-election 
seems advisable and above which superior accomplishment 
seems a certainty. The critical points are based on the ranges 
given in Table I. Tests marked by asterisks are those which, 
on the basis of Table II, seem of greatest validity in predicting 
achievement in foreign languages. 

The major impression from such data is one of the great 
extent of overlapping between groups. It may be traced to 
the heterogeneity of the Non-Electing group, to the unreliability 
of the grades used as a criterion, or most probably to the mass 
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of factors other than innate ability which affect success such as 
personality, study habits, degree of motivation and the like. At 
any rate, any attempt to establish definite limits below which 
failure seems certain and above which success seems assured 
must be treated warily. 

Because of the fact that most comparative data is in terms 
of coefficients of correlation rather than ratios it seemed desir- 
able to convert the data into that form for comparative pur- 
poses. The results in terms of biserial r are presented in 
Table IV. 


TABLE IV 
Correlation with Course Records 

















> 2 > 
4 4 a8 
ce ‘ EE 
RELATION IN er 25 = 
a2 aa ae 
& & 
es as PE 
Chronological Age .......cccccccmn — .205 + .125 | -.473+.070 | —.294+.103 
Terman IQ 532 + .102 -775 + .046 372 + .099 
Otis IQ 529 + .102 859 + .036 504 + .091 
Kuhlmann-Anderson IQ .......... 499 + .104 794 + 0.44 395 + .098 
Stanford Reading G.P. ........... 417 + .112 799 + .043 476 + .093 
Luria-Orleans Language ........... 065 + .129 -735 + .049 -760 + .067 
Stanford Physiology G.P. .... 321+ .119 645 + .057 396 + .098 
Stenquist Mechanical ............. 127 + .127 | —.105+.084 | —.201+.107 
Stanford Arithmetic G.P. ....... 502 + .104 .696 + .053 278 + .104 
Orleans Algebra Score ............ .366 + 116 .776 + 046 .461 + .094 














The interpretation as far as the relative merits of the various 
tests are concerned is the same as for Table II. In comparison 
with Kaulfers’ summary, the median language prognosis is .73 
or somewhat higher, the median of .46 for promise in algebra 
is very near his median for achievement in algebra, the median 
of .49 for achievement in reading is slightly higher, the median 
of .53 for intelligence tests is definitely higher, as is also the 
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median of .50 for achievement in arithmetic, while the median 
of — .29 for age is approximately the same. In general, it seems 
the relationships found are somewhat closer than those cited by 
Kaulfers, possibly a function of the narrower range of measur- 
ing instruments represented. 


CONCLUSIONS 


The following conclusions seem justified. 

1. The Terman, Kuhlmann-Anderson and Otis Intelligence 
tests, the New Stanford Reading Test, the New Stanford Arith- 
metic Test, and the Luria-Orleans Modern Languages Prognosis 
test accurately differentiate degree of success in beginning for- 
eign languages. 

2. It is questionable whether the Orleans Languages Prog- 
nosis Test is superior to a general intelligence test in predictive 
value. The data are contradictory. 

3. Reading achievement, though a reliable index of probable 
language record, is less valid than either general intelligence or 
the language prognosis test used. 

4. Critical scores are of doubtful value because of the great 
extent of overlapping between groups. 

5. The independence of the Stenquist Mechanical Aptitude 
Test is clearly demonstrated. 

6. The total test picture modified by subjective estimates of 
personal factors is a better estimate of probable success than 
any single test. 











A STATISTICAL ANALYSIS OF THE HUMM- 
WADSWORTH TEMPERAMENT SCALE* 


BARBARA L,. KRUGER 
Columbia University 


PURPOSE OF THE STUDY 


HE purpose of this study is to discover through statis- 
tical analysis the interrelationships which exist among 
the components of temperament as measured by the 

Humm-Wadsworth Temperament Scale. Defining a compo- 
nent as ‘‘a group of traits’’ (4) the authors devised this test 
to determine an individual’s temperament type on the basis of 
the relative strength of seven components which they have 
called normal, manic, depressed, autistic, paranoid, hysteroid, 
and epileptoid. 

The field of personality testing is attracting the attention of 
laymen and professional psychologists, alike. There are many 
personality tests on the market, but few have been subjected 
to careful objective study and attempts at validation on the 
basis of recognized criteria. 

We selected for study the Humm-Wadsworth Temperament 
Seale because it has been adopted by personnel directors in 
different parts of the country and is being used in large con- 
cerns as one of a battery of tests used in the selection of em- 
ployees. So far the only literature on the scale is that written 
by the authors (1, 2, 4, 5, 6,7). For this reason it seemed to 
offer a fertile field for research. 

Since the scale purports to measure seven components of 
temperament which are distinguishable, each from the others, 
by characteristic attitudes and feelings, the first step in study- 


1 This study was made at the University of Southern California under 
the direction of Dr. Floyd Ruch and Dr. Donald Watt in the fall of 1937. 
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ing it seemed to be a statistical analysis of its validity in this 
respect. In other words, does the scale measure seven more or 
less distinct patterns or do two or more components overlap 


to such a degree that they are but two aspects of the same 
syndrome? 


A DESCRIPTION OF THE HUMM-WADSWORTH TEST 


Theory on which the test is based. As has been already 
stated, a component of temperament is regarded as a group of 
traits (4). According to Rosanoff, on whose theory of per- 
sonality the scale is based, every individual possesses these 
components in some degree. Persons found in institutions for 
the insane and for criminals differ from normal or socially ad- 
justed individuals in degree rather than in kind; that is, their 
manifestations of some predominant component of tempera- 
ment have been uncontrolled and have functioned as liabilities 
rather than as assets. 

The strength of the so-called ‘‘NorMaL’’ COMPONENT is es- 
pecially important in the matter of adjustment to the demands 
of everyday life and in moments of stress and during periods 


of strain. The authors say ‘‘The ‘Normal’ Component is pri- © 


marily a control mechanism, providing rational balance and 
temperamental equilibrium. It underlies the conservatism and 
conformity to socially acceptable conduct observed in the well 
adjusted subject.’’ 

The other components derive their names from psycho- 
pathology, and their traits are characteristic of the respective 
mental disorders. 

The HysTrerom CoMPoNENT, besides being associated with 
hysteria is identified with ‘‘criminalism.’’ It is concerned es- 
sentially with self-preservation. It would appear that the 
authors were introducing an extraneous criterion when in de- 
scribing this component they say, ‘‘An individual with an 
excess of this component possesses a character defect with 
ethically inferior motivation, manifested by malingering, steal- 
ing, lying, cheating, and similar anti-social behavior.’’ From 
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the psychological standpoint, stealing, lying, and cheating are 
phenomena which are observed and dealt with as are temper 
tantrums and other devices employed by the individual 
attempting to control forces in his environment. 

The CyLom CoMPONENT, associated with manic depressive 
psychoses and involutional melancholia, is considered in its 
two phases, the manic and the depressed. Both are ‘‘charac- 
terized by emotionality, fluctuations in activity and interfer- 
ence with voluntary attention.’’ In scoring the answers to 
questions in the scale each phase is considered as a separate 
component. 

The Scuizoi CoMPONENT, ‘‘characterized by heightened 
imagination,’’ has two characteristic manifestations: autism 
and paranoia, each of which is regarded as a separate compo- 
nent of temperament. 

The EpiLepTow CoMPONENT, which is associated with physi- 
cal symptoms of epilepsy, latent if not manifest, is character- 
ized by meticulous application to projects and by inspirations, 
often of a religious nature. 

Development of the scale and the selection of questions. 
Accepting Rosanoff’s theory of temperament, Dr. D. G. 
Humm, a consulting psychologist, and Mr. G. W. Wadsworth, 
Jr., a personnel manager, both of Los Angeles, California, de- 
veloped this scale by making a statistical comparison of the 
answers made by normal and abnormal, or atypical, subjects 
to questions devised to detect the presence of the respective 
components. 

The authors’ detailed description (5) of how the questions 
were devised and selected is most interesting, not only in con- 
nection with this particular scale, but because of the light it 
throws on the matter of discovering significant questions for 
inclusion in any questionnaire of this sort. We will only state 
briefly the most salient points. 

In order to determine which questions were discriminatory 
with regard to the presence or absence of a given component, 
Humm and Wadsworth tried them out on patients in mental 
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hospitals who were selected on the basis of their clinical rec- 
ords. ‘‘Those whose case studies furnished no evidence of 
marked possession of a given component were contrasted with 
subjects in whom the component was strongly manifested.’’ 
An attempt was made to use only those suffering from un- 
mixed psychoses. Inmates of the state prison were used as the 
group whose answers were significant with regard to the pres- 
ence of the hysteroid component. 

In order to determine which questions were diagnostic for 
the ‘‘Normal’’ component, individuals who had been com- 
mitted to institutions were considered ‘‘abnormal’’ and their 
answers contrasted with those of persons who were employed 
and otherwise adjusted to everyday pursuits and whose case 
histories revealed no evidence of psychopathic tendencies (the 
normal group). 

The function of the ‘‘no’’ count. Because so many subjects 
were used who were unwilling or unable to cooperate erratic 
responses were sometimes observed. These took the form in 
some instances of stereotyped or patterned responses, such as 
answering all questions, ‘‘yes,’’ or all, ‘‘no,’’ or of a given 
number, ‘‘yes,’’ then a given number, ‘‘no.’’ Among crimi- 
nals, they took the form of leaving many significant questions 
unanswered. In order to ascertain whether the test had been 
validly taken the number of questions answered ‘‘no’’ were 
counted for each subject, and the authors considered as ‘‘ valid 
subjects’’ only those whose ‘‘no’’ count fell between 138 and 
196.? 

Validity of the test. ‘‘The validity of the test was deter- 
mined by a comparison of the test scores with facts in the case 
studies of the subjects, and was further determined by use of 
the test on subjects who had not been used in establishing the 
norms’”’ (5). 

2 As the test is now administered a provision is made for giving such a 
subject a ‘‘second chance’’ by having him circle in red all those answers 


about which he is doubtful. These are then scored as ‘*yes’’ if the count 
exceeds 196 and ‘‘no’’ if it is less than 138. 
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Reliability of the test. ‘‘ Reliability was tested by the split- 
half method because it is known that subjects, especially of 
eycloid temperament are likely to be characterized by changes 
of mood; so that measurement at different times would intro- 
duce an unwarranted error’’ (6). 

The reliability of the respective components ranges from 
.70 + .03 for Schizoid Paranoid to .88 + .01 for both ‘‘Cycloid 
Depressed’’ and ‘‘Schizoid Autistic.’’ 

Other considerations. We shall not go into the fine points 
of test administration, scoring, and the interpretation of scores 
because these are practical matters with which we are not con- 
cerned in this study. Our interest is in whether seven discrete 


components are discernible in a statistical analysis of test 
results. 


DATA AND METHOD OF THIS STUDY 


Sources of data. MHollerith cards,’ in which component 
scores were already punched,* were secured for 437 males who 
had consulted the Adult Guidance Service, which is super- 
vised by the Adult Education Section of the Los Angeles City 
Schools. This service is free and is available to the students 
attending the Adult Schools and to the adult public in gen- 
eral. Counseling assistance is given in ‘‘vocational, educa- 
tional, business, personal, and health problems,’’ according to 
a mimeographed announcement. 

The group which furnishes our data, therefore, cannot be 
considered a cross section of the adult male population, but 
rather as representative of those many persons who for one 
reason or another have not found a satisfactory vocational ad- 
justment. The majority are between 20 and 30 years old. We 
should expect their scores to differ in some important respects 
from those of a more heterogeneous group. 


3 For a description of the Hollerith machine and its equipment see: 
Practical Applications of the Punched Card Method in Colleges and Uni- 
versities, New York, Columbia University Press, 1935. 

4I wish to acknowledge my indebtedness to Mr. Robert C. Storement of 
the Guidance Service for supplying those cards and to the Director for 
permitting me to use them. 
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Treatment of data. Intercorrelations were worked out for 
each component with every other and with the ‘‘no’’ count. 


RESULTS 


Our findings are presented in Table 1.5 They show a con- 
sistent negative relationship between the ‘‘Normal’’ compo- 
nent and each of the other six and a positive relation to the 
‘*no’’ count. Just as consistent is the negative relation of the 
latter to the scores for the components other than the 
‘*Normal.’’ 

The Epileptoid scores show only negligible correlations with 
all of the other component scores. 

The Hysteroid scores show little relationship to these for 
other components, with the exception of the Paranoid. 

The correlation between the Manic and Depressed compo- 
nents, the two phases of the Cycloid temperament, is very high 
(.87). In contrast to this is the low correlation of .30 between 
the Autistic and Paranoid components which are considered 
two phases of Schizophrenia. 

The coefficient of .68 indicates a considerable degree of rela- 
tionship between the Autistic and Depressed components. 

A comparison of our findings with those presented by the 
authors of the test* and which are reproduced in Table II 
shows some striking resemblances. 

The striking differences lie in our negative correlations of 
the normal with each of the other six components and in the 
finding of a considerable degree of relationship between the 
Paranoid and Depressed components and practically none be- 
tween the Paranoid and Epileptoid components. 


5 We have not corrected the coefficients of correlation for attenuation 
because we wished them.to be comparable to those published by the authors 
of the test. It should be kept in mind they would all be higher if so 
corrected. 

6 Besides these zero order correlations the authors of the test worked out 
partial correlations which, however, did not alter the relationships in most 
eases enough to suggest the advisability of our doing likewise. 
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TABLE I 
Intercorrelations (Zero Order) of the Components of the Humm-Wads- 
worth Test Based on the Answers of 487 Males Who Consulted the 
Adult Counseling Service of Los Angeles 











2 S » =| 
COMPONENT 3 = v és 5 MY E 
g Bm Bee aa 5 
z% x a a < ry rm] 
**No’’ Count .... 79 -59 -.83 -.85 -.55 -.76 -.04 
Epileptoid ............. -.22 -.07 .06 15 .05 .05 
Paranoid ................. -.77 56 52 54 30 
Autistie 00000... — 42 02 49 .68 
Depressed. ............ — .75 21 87 
| RARER — .66 27 
Hysteroid ..W... —.53 
TABLE II 


Correlations (Zero Order) between Responses of Subjects, Classified by 
Components, to All Valid Questions? 








A a meter 
4 3 . 2 = Z 
COMPONENT = a = | a = 
ae ee eee 
Zz = a < < a 
Epileptoid ............. 044 177 242 056 112 406 
Paranoid .............. 551 531 497 168 387 
Autistic ....... 438 — .194 505 635 
Depressed ............. 535 032 887 
Manic ............ 694 187 
Hysteroid ............... 318 





FURTHER ANALYSIS 


The negative correlations found by us between the ‘‘Nor- 
mal’’ and the other six components of temperament as mea- 
sured by this scale suggested the possibility that the group we 
were studying might be atypical with respect to the number 
having low profile scores on the ‘‘ Normal’’ component. 

In order to test this hypothesis the cards on which the scores 
were recorded were run through the ‘‘sorter’’ of the Hollerith 

7 American Journal of Psychiatry, Vol. 92, No. 1, July, 1935, p. 185. 
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machine, to determine for each component how many subjects 
in the group had a ‘‘plus 3’’ profile score,* how many a ‘‘ plus 
2,’’ ete., through 0 which is considered ‘‘borderline’’ to 
‘*minus 3.”’ 

The results of this analysis are presented in Table III. 
They show very clearly that in our group only 151 individuals 
or 35 per cent might be classified as having a strong ‘‘ Normal’”’ 
component while 321 or 73 per cent have strong manic charac- 
teristics and 253 or 58 per cent show depressed symptoms. 
There were more subjects ranking as ‘‘strong’’ in Paranoid 
and Autistic components than in the Normal. 

The differences between the authors’ findings and ours with 
respect to the degree of relationship between the Paranoid 
component and the Epileptoid and Depressed components, re- 
spectively, may be a function of the difference between the two 
groups of subjects. The authors’ group included a number of 
patients who had been committed to institutions because of 
epilepsy. Our group, on the other hand, included a large per- 
centage of unemployed. It is quite possible that the constitu- 
tional inferiority of the epileptics gave rise to feelings which 
are characteristic of paranoia and that not being able to get a 
job made many members of our group depressed to an abnor- 
mal degree and also generated a sense of their being discrimi- 
nated against. 


CONCLUSIONS 


1. When we take into consideration the composition of the 
group studied we may conclude that our findings are in agree- 
ment with the theory of the authors as to the importance of 
the relative size of the ‘‘Normal’’ component in determining 
adjustment. They say on page 6 of the Manual of Directions, 
‘*Very impressive among our findings was the matter of inter- 
relationships. The same high quantitative showing (with the 
exception of the ‘Normal’ component) was made by a highly 


8 The profile values are derived from the raw score norms and are pub- 
lished in the Manual of Directions for scoring the test. 
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successful salesman as by a psychiatric subject suffering from 
a manic-depressive psychosis . . . its significance in terms of 
behavior differed by reason of the strength of the ‘Normal’ 
component which was high in the salesman and low in the psy- 
chiatric subject.’’ 

Table III reveals that many persons who exhibited strong 
tendencies toward behavior which in its extreme, uncontrolled 
form, is diagnosed as psychotic, did not reveal a correspond- 
ingly high degree of the ‘‘Normal.’’ This may explain why 
the members of the group as a whole find themselves in situa- 
tions in which they need the kinds of assistance offered by the 
Guidance Service. 

A further indication that this may be the case is furnished — 
by a comparison of Tables I and II, which reveals a consis- 
tently different relationship between the ‘‘ Normal’’ component 
and each of the others for the two groups studied. For our 
group the coefficients are all negative, for the other group all 
positive. 

2. The correlation between the ‘‘Normal’’ component and 
the ‘‘no’’ count being positive and those between the other 
components and the ‘‘no’’ count being negative, are in accord 
with the statement made by the authors of the test, that ‘‘the 
‘Normal’ component is primarily a control mechanism... 
essentially a ‘brake’ or a ‘balance wheel’’’ and ‘‘presents 
mainly characteristics associated with restraint.’’ 

3. Our intercorrelations for the components of tempera- 
ment, with the exception of the ‘‘Normal,’’ are not in accord 
with the theory that such identifiable syndromes exist : 

a. The correlation between the Paranoid and Hysteroid 
components suggests overlapping of these two. 

b. The Manic and Depressed components are so closely 
related as to suggest that as here measured they are one and 
the same. This high correlation may be due to the way that 
the weights for each question were derived. If the answers 
of subjects who were suffering from a manic-depressive psy- 
chosis were weighted for the Manic component without de- 
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termining whether the subjects were in the manic phase or 
the depressed we should expect to find that the final weights 
were a measure of the factors common to both rather than a 
means of distinguishing between the two patterns. 

e. The correlations of the Autistic and Paranoid compo- 
nents with the others indicate that they do not exist as dis- 
tinct factors in temperament or that the test, in its present 
stage of development, does not include sufficiently discrimi- 
nating measures of these factors. 

d. The above analysis of the differences between the find- 
ings for the two groups with respect to the size of the corre- 
lations between the Paranoid and the Epileptoid and be- 
tween the Paranoid and the Depressed components, sug- 
gests that Paranoid reactions are the result of circumstances 
rather than being indicative of a distinct type of tempera- 
ment. 

4. There is no doubt that this test offers possibilities for 
further research, among them: 

a. The determination of the reliability of the scale by the 
test-retest method. A scale which purports to measure tem- 
perament should stand up under the retest method. If it 
does not we must conclude that what it measures is some- 
thing far more transitory, a ‘‘mood’’ for instance, rather 
than the fundamental personality pattern or organization 
which gives rise to such. 

b. The determination of the validity of the scale by check- 
ing the replies of large numbers of persons, representative 
of a cross section of the community, against some objective 
criterion like a well devised objective rating scale which 
might include behavior-grams for the guidance of the rater. 
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A SHORT SCORING METHOD FOR THE LINK 
P.Q. TEST 


CHARLES C. GIBBONS 
Ohio State University 


HIS paper describes an economical method of scoring 
g% Link’s P.Q. or Personality Quotient test. The pub- 
lisher provides, with each test, a score sheet on which 
the individual’s responses are entered and his score computed. 
The test manual states that hand scoring for Trait 1 alone 
takes about ten minutes; for all five traits, about twenty min- 
utes. This scoring time has restricted the use of the Link test. 
An attempt was made to find a time-saving method of scor- 
ing the Link test by the use of scoring strips and Veeder coun- 
ters. The method by which these scoring strips were made is 
described below. By this method a test can be scored for Trait 
1 in three minutes; for all five traits in eight minutes. It 
would be much less expensive for the publishers to print these 
scoring strips instead of the score sheets which they now pro- 
vide.. For the present, however, those who wish to use this 
method will need to make their own scoring strips. Two per- 
sons, working together, can make a complete set of scoring 
strips in about four hours. 


HOW TO PREPARE SCORING STRIPS 


To prepare the scoring strips, one needs a copy of the test 
and score sheet for boys, a copy of the test and score sheet for 
girls, and twenty-two strips of white cardboard, size 2} x11 
inches. It is more economical for two persons to work together 
in preparing the scoring strips. One person reads the score 
values from the score sheet which accompanies the test. The 
other person places a cardboard strip on a test and writes the 
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score values, as they are read, opposite the corresponding 
items. 

Except for the first six questions, all items are scored either 
0,1, or 2. The score sheet provided with the test indicates the 
amount of credit to be given for a ‘‘ Yes,’’ ‘‘No,’’ or ‘‘ ?’’ re- 
sponse. Making scoring strips consists of writing these score 
values on strips so that when the strip is laid on any test the 
score values will be in alignment with the corresponding test 
items. Figure 1 is a portion of a scoring strip for boys’ Trait 
1. The numbers in parentheses are the numbers of the items. 
The two columns on the key correspond to the two columns on 
each page of the test. No score values are entered for items 
126, 145, 147, and 149, because the score sheet shows that these 
items are not scored for Trait 1. In this manner, strips are 
made for each of the four pages of the test. 


WORN 











2,0,1 (/23) (144) 2,0,/ 
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Fie. 1. Portion of scoring strip for boys’ Trait 1. 


To prepare scoring strips for the other four traits which the 
test attempts to measure, proceed in the same way except that 
the values for Traits 2 and 3 (Social Initiative and Self-Deter- 
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mination) are put on the same strips. Write Trait 2 values in 
red ink and Trait 3 values in blue ink. For the few items that 
are scored for both traits, write the values in red and encircle 
them with blue. In the same way, prepare a set of strips con- 
taining values for both Trait 4 and Trait 5 (Economic Self- 
Determination and Sex Adjustment). Write Trait 4 values 
in red ink and Trait 5 values in blue ink. 

The scoring strips for girls’ tests are prepared from a girls’ 
test and answer sheet by the same method. 


HOW TO USE THE STRIPS IN SCORING 


For one who wishes to score a large number of Link tests, 
the following steps are suggested : 

1. Separate the boys’ tests from the girls’ tests. 

2. Secure two Veeder counters which have been mounted on 
a board and the four scoring strips for boys’ Trait 1. For 
items 1-6, inclusive, compute the amount of credit by the rules 
stated on the score sheet (These rules may be abbreviated on a 
scoring strip). Count these values on the Veeder counter 
mounted on the right side of the board. 

3. For the rest of the items, count on the right-hand counter 
the values on the scoring strip that correspond to the person’s 
responses. The values 2, 0, 1 (see Fig. 1) mean 2 points credit 
for a ‘‘Yes’’ response, 0 points for a ‘‘No’’ response, and 1 
point credit for ‘‘?.’’ Similarly, 0, 2, 1 are the amounts of 
eredit for ‘‘ Yes,’’ ‘‘No,’’ and ‘‘ ?,’’ respectively. 

4. After all the papers have been scored for Trait 1, secure 
a set of strips for Traits 2 and 3. The method of counting 
values is the same as for Trait 1 except that values in red (Trait 
2) are counted on the right-hand Veeder counter and values 
in blue (Trait 3) are counted on the left-hand Veeder counter. 

5. Secure a set of strips for Traits 4 and 5. Proceed as 
before, counting values in red (Trait 4) on the right-hand 
counter and values in blue (Trait 5) on the left-hand counter. 

Experience has shown that this scoring method: (1) Re- 
duces scoring time for Trait 1 from 10 minutes to 3 minutes. 
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(2) Reduces scoring time for all five traits from 20 minutes to 
8 minutes. (3) Eliminates the expense of having a score sheet 
for each person. (4) Decreases the likelihood of errors in 
scoring. (5) Reduces monotony and eye-strain on the part of 
the scorer. 

















NOTES 


CRITICAL NOTE ON THE PRESSEY INTEREST- 
ATTITUDES TEST 


ROBERT L. THORNDIKE 
Columbia University 


In 1933 Pressey and Pressey! reported a new personality test, which was 
an outgrowth of the earlier X—O test. This they called an Interest-Atti- 
tude test. It consists of four sub-tests of 90 items each. In the first test, 
the subject is instructed to check once everything that he thinks is wrong 
or that a person should be blamed for, and to check twice everything that 
he thinks is very wrong. The second test related to worries, the third to 
interests, and the fourth to traits admired in people. 

The scoring of this test is in terms of a maturity score. Items are 
included in the test which show a progressive change in the frequency of 
checks as one-goes from sixth-grade groups to junior and senior high 
school groups and on to college groups. Scores are to be translated into 
grade equivalents, and indirectly into age equivalents. 

Certain items of the test show a greater frequency of checks in the 
higher age and grade group. In most cases, however, the frequency of 
checking decreases as one goes up the age scale. This is the case in 80 of 
the 90 items in the first two tests, 60 of the 90 in the third, and 40 of the 
90 in the fourth, or in all 260 out of 360. This makes total score on the 
test, and especially score on the first two sub-tests, dangerously dependent 
upon what we may call the ‘‘checking threshold.’’ That is, a person may 
get a low maturity score (corresponding to checking many immature and 
few mature items) merely because he tended to check freely both mature 
and immature items and the test gave more than twice as much opportunity 
to check the latter. 

It is possible to demonstrate the influence of such a ‘‘checking 
threshold’’ by studying the responses on sub-tests 3 and 4, where the 
number of mature and immature items is more nearly equal. The test 
papers of a group of 25 gifted boys and a group of 24 gifted girls ap- 
proximately 10-12 years of age were analyzed to bring out this factor. 
First, the total number of checks on mature items was correlated with the 


1 Sidney L. Pressey and Luella C. Pressey, ‘‘ Development of the Inter- 
est-Attitude Tests,’’ Journal of Applied Psychology, 1933, 17, 1-16. 
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number of checks on immature items. The correlation was .83 for boys 
and .90 for girls. In other words, there is a very marked positive relation- 
ship between the number of mature items checked and the number of im- 
mature items checked. If the test is measuring primarily maturity, we 
should expect the individual who marks many mature responses to mark 
few immature ones, and vice versa. This was not the case. 

Then a correlation was obtained between score (immature responses 
minus mature responses) on one half of the items and the other half of 
the items. This was found to be .64 for boys and .75 for girls. In this 
case, the influence of the ‘‘ checking threshold’’ is largely eliminated, as a 
result of the subtraction. We find that when this effect is eliminated there 
is still a fairly consistent differentiation on the basis of maturity. 

Finally, a correlation was computed between total number of checks on 
one half of the mature and the immature items and total number of checks 
on the other half. The result was .87 for boys and .93 for girls. In our 
first pair of correlations the ‘‘checking threshold’’ and the maturity in- 
fluence were working in opposition. In the correlations just reported, the 
influence of maturity is removed, and we get a pure indication of the 
consistency of the ‘‘ checking threshold.’’ 

These results show clearly that for the group here studied an individual 
was characterized much more clearly and reliably by his ‘‘ checking 
threshold’’ than he was by his maturity score. When the two forces were 
in opposition, as in our first correlations, the ‘‘ checking threshold’’ was 
clearly predominant, and when the influences were separated, as they 
were in our second and third sets of correlations, the ‘‘ checking threshold’’ 
provided the more reliable score. 

For sub-tests 3 and 4, where the number of mature and immature items 
is about equal, the influence of the ‘‘ checking threshold’’ is removed by 
the technique of scoring. In sub-tests 1 and 2, however, and in the total 
score for an individual, it may well be the predominant factor. Of course 
it is possible that mere willingness to check items may be a satisfactory 
index of immaturity. It seems, however, that we should view the score 
of an individual with scepticism when we find evidence that it may depend 
primarily on such a trivial and ill-defined tendency. 











A NOTE ON THE CRITICAL NOTE 


SIDNEY L. PRESSEY 
Ohio State University 


Dr. Robert Thorndike has kindly sent the writer a carbon of the note 
reporting the ‘‘checking threshold’’ as a major and largely irrelevant 
factor in scores of 48 children 10-12 years of age on the ‘‘ Interest-Atti- 
tude’’ Tests. He much appreciates this opportunity to make early com- 
ment. 

The possibility that mere tendency to check might be a dominant ele- 
ment in total response on such a test was so much a matter of concern 
to the writer when, nearly twenty years ago, he developed the X-O Tests 
that he put in every line of two of the tests a ‘‘joker’’ or word which 
would be marked only if the subject checked over-extensively and indis- 
eriminately.1 Marking of these jokers would signal the scorer when this 
factor appeared, and number of jokers marked might even serve as a 
mvasure of this tendency. But nobody seemed to make use of the jokers 
except the writer. And to his surprise, he found very few instances of 
such tendency to indiscriminate marking. In the X-O Form B the joker 
idea was not kept because of the above reasons and because emphasis was 
on the individual items and ‘‘net differential scores’’ where the factor 
should cancel itself out. 

When work was started on the Interest-Attitude tests the matter was 
again given consideration. The original intention was to have in each test 
about an equal number of ‘‘mature’’ and ‘‘immature’’ items, so that the 
mere tendency to check would be cancelled out as a factor in total score. 
But extensive trial lists failed to yield sufficient wrongs and worries which 
were marked more often with increasing age because of increasing emo- 
tional maturity rather than increasing information or intelligence.2 The 
empirical fact seemed to be that one important characteristic of the devel- 
opment of personality from ten to eighteen was a vigorous discarding of 
childhood ideas of wrong and childish fears. 

1 Pressey and Chambers, ‘‘ First Revision of a Group Scale Designed for 
Investigating the Emotions,’’ Journal of Applied Psychology, 1920, 4, 101. 

2 The point is briefly touched upon in the article describing the develop- 
ment of the tests (Journal of Applied Psychology, 17: 1-16) as is also the 
possibility of spurious effects due to the ‘‘ tendency of some children to use 
a good many checks generally while some children tend to mark less.’’ 
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To have dropped the wrongs and worries tests because of this lack of 
balance between mature and immature items would then have meant the 
omission of something very significant. A different test technique might 
have been used. But with this single-word question technique students 
were indicating three levels of response to 360 items (a total possible range 
of score of 720 points) in 30 minutes all on a small four-page folder, and 
scoring was easy (and fitted a plan for machine scoring which the author 
then had in mind). Convenience is worth while. A rich yield of score 
within a convenient time is needed for adequate measurement. Other tech- 
niques involved other possible irrelevant tendencies (most tests in the field 
involve some such). A change in technique at this juncture seemed likely 
to lose far more than it gained. The decision therefore was to build the 
tests on the data, and then see what results a trial form yielded. 

The trial showed that from grades 5 to 12 medians changed 140 points— 
the score appeared sensitive to something. Further (contrary to what one 
would expect if mere tendency to check were an important factor), over- 
lapping of grade groups was surprisingly slight for tests in this field; 
thus only 20 per cent of boys in the eighth grade scored below the median 
for the sixth. Comparisons of groups (as country high school versus 
sophisticated girls finishing school) showed results which would be ex- 
pected. Correlations with estimated emotional maturity were reasonably 
good. Comparisons of item analyses with similar items in other studies 
of play and interest showed parallel curves. 

It was therefore decided to issue the tests, with very complete data— 
norms for each sex and each test and for each one of the 360 items for a 
total of over 4000 cases. It was felt that the form should at least be 
valuable as a research questionnaire of exceptional range and richness of 
interpretive data (thus item comparisons between groups should not be 
grossly warped by such a general factor as the checking threshold). The 
scores did seem to have significance. And with both positive and negative 
items in each test (and in different proportions in different tests), and with 
such full data, an unusual variety of cross comparisons in research could 
be made, and possible faults readily detected. Presumably Dr. Thorn- 
dike’s data at the very least leave the test this last merit! 

The writer would argue that the merit is more important than might 
at first seem—and that Dr. Thorndike has touched upon a need far more 
important than he realizes. It is not after all an extraordinary omission 
in modern test technique that methods have not been developed by which 
the ‘‘internal consistency’’ of an individual’s performance on a test may 
be readily determined! And is it not a merit of the Interest-Attitude 


8 Almost twenty years ago the writer argued that on the Binet irregular- 


ity might be used in something this way (‘‘Irregularity on a Binet exami- 
nation as a measure of its reliability,’’ Psychological Clinw, 12: 236-40, 
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tests that they readily permit such handling? Since the first test contains 
only 10 negative items out of 90 and the last 50 out of 90, the difference 
between score on first and last tests should be evidence of the extent to 
which such factors as the ‘‘checking threshold’’ have operated. The 
writer is planning shortly to go over results now on file, in such fashion, 
and make further study of this total issue. 

In the meantime he would contend (in summary) that the weight of 
evidence was against the conclusion that a ‘‘ trivial and ill defined tendency 
to check’’ was often a devastatingly important factor in scores on these 
tests (though it may be so in such special groups of children, at the bottom 
of the age range for which the tests are planned* )—that the tests had, in 
fact, certain distinctive merits. He would argue further that Dr. Thorn- 
dike himself has all but demonstrated that a ‘check upon checking’ is 
readily obtained for each pupil. And he would contend finally that failure 
to utilize the interrelations between sub-scores as well as their sum, and to 
eall for evidence of internal consistency as well as total in reporting an 
individual’s score, are omissions in present-day testing methods which 
might well receive attention. 





1919). But nobody (including the writer) made much use of the idea; 
and trial of the notion on group tests of intelligence yielded too incon- 
elusive results (for reasons which need not be gone into here) to publish. 

* However, in later correspondence Dr. Thorndike has kindly sent the 


scores of these children—and they score as more mature than their 
chronological age about as the writer would expect they should. 














NOTE ON ‘“‘1.Q. CHANGES IN FOSTER HOME 
CHILDREN’’ BY EMMETT L. SCHOTT 


ROBERT L. THORNDIKE 
Columbia University 


In his report in this journal,1 Schott states that the median gain of 5.8 
points for 74 children retested after placement in foster homes is not 
statistically significant. He says ‘‘the ratio of the actual difference to 
the standard deviation of that difference is slightly less than .28.’’ 

A casual examination of the data suggested that this statement was 
probably incorrect, so the computations were checked using Schott’s Table 
III. From this table it was found that the mean test-retest difference 
was +3.95 points of 1.Q., the standard deviation of the distribution of 
differences was 7.3, and the standard error of the mean difference was 0.85. 
The difference was about 4.6 times its standard error. Obviously, it was 
statistically significant. 

The conclusion of Schott’s article should read: ‘‘After an average 
length of residence in a foster home of about one year, our group show a 
statistically reliable gain in I[.Q.’’ 

1 Emmett L. Schott, ‘‘1.Q. Changes in Foster Home Children,’’ Journal 
of Applied Psychology, 1937, 21, 107-112. 











NEWS AND NOTES 


Within the past year a number of psychologists have conducted studies 
on problems pertaining to the radio under the stimulus and guidance of 
the Princeton Radio Research Project of the School of Public and Inter- 
national Affairs of Princeton University. These studies comprise a part 
of the research program in this field made possible by a grant from the 
Rockefeller Foundation and directed by Dr. Paul Felix Lazarsfeld in 
association with Dr. Hadley Cantril of Princeton, and Dr. Frank N. 
Stanton of the Columbia Broadcasting System. 

Owing to the interest and importance of these studies the editors of 
the JOURNAL oF APPLIED PSYCHOLOGY are pleased to announce that its 
next issue, February, 1939, will be a special number devoted primarily to 
publication of these studies with Dr. Lazarsfeld as guest editor. The 
contents will include reports on the influence of the radio as a factor in 
different kinds of activities as well as research on fundamental index 
problems and problems of questionnaire handling. There will also be a 
review of the literature on listening habits and notes of current research 
in this general field being conducted at various institutions. The guest 
editor, Dr. Paul Felix Lazarsfeld, would appreciate communications from 
psychologiste who have conducted, or are currently engaged in, any 
researches bearing on radio problems. He may be addressed at the Office 
of Radio Research, Field Headquarters, 22 East 17th Street, New York 
City, 


The National Research Council announces the publication of a new 
quarterly entitled Psychosomatic Medicine with the sponsorship of the 
Committee on Problems of Neurotic Behavior, Division of Anthropology 
and Psychology. The inauguration of this publication has been made 
possible by a grant from the Josiah Macy, Jr., Foundation. The aim of 
Psychosomatic Medicine is to encourage and bring together studies which 
make a contribution to the understanding of the organism as a whole, in 
somatic and psychic aspects. These materials are now usually separated 
widely in manner and place of publication because of differences in con- 
cept, approach and methods. The board of editors include the follow- 
ing: Managing Editor, Flanders Dunbar; Editors, Franz Alexander, 
(Psychoanalysis) ; Dana W. Atchley (Internal Medicine) ; Stanley Cobb 
(Neurology); Hallowell Davis (Physiology); Flanders Dunbar (Psy- 
chiatry) ; Clark L. Hull (Psychology) ; Howard 8S. Liddell (Comparative 
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Physiology); and Grover F. Powers (Pediatrics). All correspondence 
relative to subscriptions or to the publication of articles should be 
addressed to Dr. Flanders Dunbar, Managing Editor, Academy of Medi- 
cine Building, Room 445, 2 East 103rd Street, New York City. 


The University Tutorial Press, of London, England, now has ready a 
revised edition of ‘‘A Manual of Psychology’’ by G. F. Stout prepared 
in collaboration with C. A. Mace, of the University of London. Many 
chapters have been radically revised or largely rewritten, and a new 
chapter on ‘‘ Practical Conative Activity’’ is introduced. An Appendix 
concerning Gestalt Psychology by Dr. R. H. Thouless has been included 
which should greatly increase the usefulness of the book. 


The story of the production of copper is revealed in four new educa- 
tional motion picture films made under the supervision of the Bureau of 
Mines, U. 8. Department of the Interior, in cooperation with one of the 
larger mining companies. The titles of these films are ‘‘Copper Mining 
in Arizona,’’ ‘‘Copper Leaching and Concentration,’’ ‘‘Copper Smelt- 
ing’’ and ‘‘Copper Refining,’’ each showing the various steps in this 
interesting industry. Copies of these films, 16-mm. and 35-mm. size, for 
exhibition by those interested may be obtained from the Bureau of 
Mines Experiment Station, 4800 Forbes Street, Pittsburgh, Pa. No 
charge is made for the use of the films; the exhibitor pays only trans- 
portation charges. The width of film desired should be stated. 


The Institute of Human Relations of Yale University, together with 
the American Association of Motor Vehicle Administrators and the Esso 
Safety Foundation from the Standard Oil Company of New Jersey, is 
announcing an automobile driver research project in an effort to study 
the human element in motor vehicle accidents and to establish uniform 
standards for the examination of motor vehicle operators. The project 
will be carried out under the immediate direction of Dr. Harry R. De- 
Silva, nationally recognized authority on highway safety and automobile 
drivers. The driver research project will be carried on primarily to 
obtain much needed material on which to base examinations of motor 
vehicle operators. It will also carry on studies into the relation between 
the ages of driversa accidents. Efforts will be made to determine the 
most efficient methods for reeducating careless drivers and to obtain new 
and scientifically accurate data on eye defects among motorists. It is 
hoped to unearth through this research many pertinent facts which will 
aid in the framing of new and more effective motor vehicle legislation 
and regulation. ‘‘Unfortunately,’’ says Dr. DeSilva, ‘‘ many state motor 
vehicle laws have been passed without the benefit of scientific facts ob- 
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tained from equally scientific research. This study will open up a whole 
new field. It will result in the setting up of minimum standards for the 
examination of applicants for licenses to drive motor vehicles and will go 
a long way toward the nation-wide standardization of the laws governing 
driver examinations in the various states.’’ 


The following new publications issued by the Office of Education, 
United States Department of Interior, may be obtained at nominal cost 
from the Superintendent of Documents, Government Printing Office, 
Washington, D. D.: 

**Higher Education, 1930-36,’’ ‘‘Statistics of City School Systems, 
1935-36,’’ ‘‘College Student Mortality,’’ which tells how many students 
stop going to college, and why; ‘‘ Education and the Civil Service in New 
York City,’’ ‘‘Publie Affairs Pamphlets—An Index to Inexpensive 
Pamphlets on Social, Economic, Political and International Affairs.’’ 
How colleges and universities face or fail to face the problems of stu- 
dent accidents, illnesses, and fatigue is reported in a pamphlet, ‘‘Sani- 
tation in Institutions of Higher Education.’’ In ‘‘ Nature and Use of 
the Cumulative Record’’ the author, Dr. David Segel, Specialist in Tests 
and Measurements in the Office of Education says, ‘‘Greatest aid to a 
pupil can be given only when his rate and trend of development in vari- 
ous intellectual, social and physical traits are known.’’ 
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Henry C. Linx. The Rediscovery of Man. New York: Macmillan Com- 
pany, 1938. 257 p. $1.75. 

**Nero fiddled while Rome burned. Our popular leaders sing sweet 
songs while the character of a nation crumbles.’’ These lines from the 
first chapter indicate not only the theme but also the scope of Dr. Link’s 
book. 

His thesis is that American personality and character are crumbling 
under the rising tide of paternalism, both in the home and in the govern- 
ment. The paternalism itself arises from three hundred years of concern 
with physical sciences and the consequent neglect of the capacities of man. 
Mankind’s position and importance in the scheme of things has been im- 
pugned to such an extent that the individual is now prone to regard 
himself as a helpless clod tossed about and victimized by an environment 
over which he has no control. Governmental paternalism seems a rational 
solution of his difficulties to such an individual. 

The Rediscovery of Man is aimed at the refutation of the view of the 
individual as irresponsible—a view which the author conceives to be 
mechanistic. Basing his approach on personality studies, Link takes into 
consideration habits essential to adequate, self-reliant personality de- 
velopment and the manner in which they may be acquired. The author 
has but little patience with the personality tinkering being done in fields 
outside psychology, pointing out that personality development and the 
modification of habits, desirable or undesirable, are strictly psychological 
problems. Link regards, and with justification, that non-psychological 
tinkering with personality is but another expression of the individual’s 
wish to depend upon external agencies rather than take the responsibility 
for correcting his own traits. Communism, fascism, and our own bene- 
volent paternalism with its Social Security fit into the same picture, he 
asserts. 

The Rediscovery of Man is not a scientific book, as the term is properly 
used. It is pragmatic. Link is interested in getting action from his 
readers in the direction of assuming greater responsibility for their own 
welfare and personality characteristics, and in correcting in the future 
some of the present abnormalities. Probably few will agree with all the 
author says; most will be delighted that he has said it. Those who have 
a physical conception of organisms will be puzzled by Link’s use of the 
term ‘‘mechanism’’ until they are well into the book where they will 
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discover that he is quite as ‘‘mechanistic’’ in his psychology as are most 
of us. 

The book is written in a vivid and popular style, which is to say, the 
author knows his field sufficiently well to write about it in simple and 
exact words. No one will misunderstand what the book offers, but he will 
be provoked to intensive thinking in defense of some of his own beliefs. 

This reviewer found the opening chapters a trifle slow and the central 
portion fascinating. The last part of the book, in which the author is 
applying his thesis to a wide range of current situations and beliefs and 
relating it to the Christian Ethic, seemed to lack Link’s customary 
organization. 

Psychologists have recently been crying for works that get into the 
home and the marketplace. The Rediscovery of Man gets there. Link 
has opened up a vital problem—so vital that one wonders if most psy- 
chologists are not also fiddling while Rome burns. The book may be read 
by all psychologists for both pleasure and profit. In the interest of humil- 
ity and for the good of their scientific souls, the book is to be recommended 
particularly to those who, like the reviewer, hold a physical conception of 
psychology. 

MATTHEW N. CHAPPELL, 
842 Madison Avenue, New York City. 


Strange, RutH. Behavior and Guidance of Students in College and Sec- 
ondary School. New York: Harpers, 1937. Pp.515. $4. Counseling 
Techniques in College and Secondary School. New York: Harpers, 
1937. Pp. 159. $2. 

These two volumes constitute the second and third of a series of five by 
the author on the general field of guidance in educational institutions. 
The emphasis is more on personal or personality guidance, although 
vocational aid is naturally bound up in the discussion. Volume II is 
chiefly concerned with adjustment. Strang opens with two lengthy chap- 
ters on the problems of the student, such as emotional maladjustment, 
disciplinary problems, vocational worries, and emancipation from the 
family. The next four chapters deal with measurement and interpreta- 
tion of scores: intelligence, academic achievement, personality, and at- 
titudes and interests. These contain comprehensive and critical sum. 
maries of these fields, worked out in such a way that a counselor is aided 
in the choice and utilization of the various tests and measures. Pitfalls 
are indicated throughout. Finally we have discussion of the social and 
economic background of students, the use of time by high school and col- 
lege students, and financial problems of college students. 

This volume must be considered a compendium more than anything else. 
Dr. Strang has worked in 1310 references, often as many as a dozen on a 
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single page, yet has skillfully blended in quotations and citations to make 
the manuscript smooth and readable, as well as informative. One’s pro- 
jected use of the book determines its valuation. To the well trained 
psychologist or counselor, much of the material will seem superfluous, as 
it duplicates topics taken up in general psychology, or courses in tests 
and measurements. But to the individual who may be asked to do some 
counseling on the side, in addition to his regular teaching duties, the 
book should be remarkably handy in giving much valuable data, critical 
outlooks, and a well oriented point of view. 

Volume III aims to give more practical techniques of counseling. The 
principles suggested are equally applicable to college or high school levels. 
In the reviewer’s opinion the outstanding feature of this shorter volume is 
the discussion of making the case study, handling the interview, making 
observations, and using rating scales. The various hidden angles, pitfalls, 
and traps into which the unwary counselor might fall are pointed out in 
very astute fashion. The necessity of keeping permanent records for 
cumulative purposes and to obviate probable loss of memory for details 
is well pointed out. The author has also pointed out the many variables 
in the background, as well as the present environment of the student 
which are necessary to consider in arriving at a comprehensive diagnosis 
and constructive solution of the problem, personal, educational, or 
vocational. 

There is one limitation of this work, as well as of virtually every book 
on counseling, whether personal or vocational. This is the absence of a 
suggested constructive program. It is admittedly difficult to outline a 
sequence for someone else to follow; counseling must remain an individual 
matter, adapted to each individual, and hence it cannot be completely 
standardized. Yet, considering that Strang has given much elementary 
material which an experienced psychologist or trained counselor already 
knows, it would appear logical that the less experienced counselor would 
be very eager to find an outlined or skeleton organization of the conduct 
of a case from beginning to end, with approaches and possivle modes 
of solution suggested. Possibly a few case studies, reproduced more or 
less verbatim from actual interviews, with objective data which are to be 
considered interspersed, would give constructive advice of value. 

R. W. Hussanp, 
University of Wisconsin. 


LELAND W. Crarts, THEODORE C. SCHNEIRLA, Exsa E. Ropinson, RatPu 
W. Gricurist. Recent Experiments in Psychology. New York: 
McGraw-Hill Book Company, 1938. Pp. xiv +417. 

The diversity of what may legitimately be called psychological experi- 
mentation presents a serious problem in the choice of representative 
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studies. The authors of this volume have selected twenty-eight topics, 
devoting a chapter to each. These topics range from semi-popular studies 
of a ‘‘lightning-calculator’’ and of vocational analysis from photographs 
to the electro-physiology of the nervous system and the duplicity theory of 
vision. 

In most chapters a brief introduction is followed by condensed reports 
of one or several studies. Purpose, method, results and conclusions are 
given in some detail for a majority of the experiments. Consideration 
of the statistical significance of results is avoided. 

Recent Experiments in Psychology can perhaps be used with profit as 
supplementary reading for a class in experimental psychology. The con- 
densation of journal material makes it too difficult for the student in an 
introductory course while the failure to consider the majority of evidence 
on any topic reduces its value for those capable of comprehending the 
content. In the opinion of the reviewer, the presentation of experimental 
method to the members of elementary courses in psychology demands em- 
phasis upon the techniques of sound research rather than examples of 
investigation. 

Grorce K. BENNETT, 
The Psychological Corporation. 


Lovis P. THorpPe. Psychological Foundations of Personality. New 
York: McGraw-Hill Book Company, 1938. 

It is a little difficult to decide exactly what audience was expected to 
receive this book. From the preface, one would infer that Dr. Thorpe 
was writing for undergraduate college students, particularly in the field 
of education, who were going to become school teachers. The preface 
states that, ‘‘This volume is offered as a text book in courses wherein 
the nature of that most basic of all psychological concepts, the human 
personality, is given primary consideration. It is also offered as a 
source of easily accessible information to that host of earnest educators 
and professional men . . . who are anxious to secure a background in 
this field.’’ 

After reading this preface, I expected to find a book which dealt in 
fairly simple terms with the general problems of personality, giving a 
considerable amount of practical advice as to personality development and 
training. On the contrary, I found the book to be written in a very 
technical fashion with only one chapter, about one-tenth of the book, 
which could be called practical in its approach. 

Considered as a whole, the book suffers from excessive documentation. 
Dr. Thorpe reels up the impressive total of 1154 footnotes, which is prob- 
ably a record of some kind. Unfortunately, many of these footnotes deal 
with relatively unimportant studies and are likely to confuse rather than 
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aid the undergraduate student. The author’s manner of treating a given 
subject is to quote a considerable number of authors and investigations 
on each side of a controversial subject, finally giving his own conclusions. 
It was rather disappointing however, to notice that on some occasions his 
own conclusions could not be discovered and the controversial topic was 
left suspended in mid-air. 

The introductory material is heavily loaded in the direction of a biolog- 
ical viewpoint ; for example, some 30-odd pages are devoted to genes and 
the Mendelian studies of heredity—material which is probably rather 
technical for these students; but after having worked up this matter in 
such elaborate fashion, he is forced to say, ‘‘ We must all guard against 
the apparently strong tendency to regard traits as unitary blocks in per- 
sonality and as being potential or inevitable in gene inheritance.’’ (Page 
280.) If this is a tendency which must be guarded against, why should 
Dr. Thorpe go into such great detail about the mechanism of gene in- 
heritance? Another instance of hereditary bias occurs in his discussion . 
of maturation. Here he quotes those experiments which indicate that 
practice during the early years does not give a substantial advantage over 
maturation, but fails to quote any of the experimental data which show 
that training in certain types of activity may lead to substantial improve- 
ment which is not overcome by subsequent development. 

I found myself most dissatisfied, however, with the treatment accorded 
to such very fundamental problems as feelings, emotions, and motives. 
His treatment of the psychoanalytic school is exceedingly short, and some- 
what flippant. Freud, for example, gets a great deal less space in the 
book than MacDougall or Thorndike; although I feel reasonably certain 
that the science of psychoanalysis has contributed a great deal more to the 
understanding of personality than the educators who are frequently 
quoted. The treatment of Jung seems to me to be particularly unfair. 

The author’s point of view with regard to fundamental drives seems to 
be somewhat confused. At one time it seems that he has rejected the 
notion of motivational ‘‘energies’’ as having the same mystical nature as 
instincts. On page 275, however, we find this statement, ‘‘ There is much 
as well to commend the idea of draining off energy, sexual or otherwise, 
in the physically active realms of work, play, and athletics.’’ 

Dr. Thorpe’s position with regard to clinical psychology seems to be 
very critical. He makes the assertion, for example, that most clinical evi- 
dence is based too much on introspection and is lacking in experimental 
vertification. In his practical suggestions, however, we find that he relies 
not upon experimental, but upon clinical evidence. 

If we consider the various books with which this one might be com- 
pared, we find that by dissipating itself in trying to cover too many dif- 
ferent kinds of problems or points of view, it fails effectively to do any 





























BOOK REVIEWS 671 


of the various jobs which it might have done. From the point of view 
of objective and experimental psychology, it is somewhat inferior to 
Shaffer’s Psychology of Adjustment, and from the point of view of 
psychological theory, it cannot compare with Allport’s Personality—A 
Psychological Interpretation. It does not seem to me to have as many 
good practical suggestions, particularly for children, as Morgan’s Psy- 
chology of the Unadjusted School Child. This evaluation may suggest 
that the book has no value at all, an impression which I did not intend 
to give. It does contain a wealth of material, but is presented in such 
fashion that graduate students will probably find it difficult enough and 
the average undergraduate student will probably find it impossible. 

I should say that it would be useful as a handbook of psychological 
literature, particularly on the biological approach to many of these 
problems, because of its remarkable completeness of documentation. 

A poor job of proof reading makes it a bit exasperating to read. 

Ross STAGNER, 
University of Akron. 


F. F. Powers, T. R. McConne.i, W. C. Trow, B. V. Moore, anp C. E. 
SKINNER. Psychology in Everyday Living. New York: D. C. Heath 
and Co., 1938. Pp.x+51l. $2.75. 

To the sophisticated reader the title of this book suggests that as a 
textbook it deviates from the conventional text, such as Woodworth’s, 
Dashiell’s, or Warren and Carmichael’s, both in scope and in organiza- 
tion of subject matter. The book justifies its title. The authors have 
adhered to the express purpose of preparing 4 book ‘‘to meet the need 
of the college student for a well-integrated, modern text which will mate- 
rially contribute to his ability to manage his own life. . . .’’ 

A text that devotes more than one half of its space to personal and 
social adjustment and to ‘‘applications of psychology’’ naturally must 
do so at the expense of other subject matter. The most notable omissions 
are in the fields of the nervous system and of the sensory and perceptual 
processes. Approximately four pages are devoted to the nervous system 
in chapter 3, entitled Foundations of Behavior. In the same chapter 
slightly more than five pages are devoted to the receptor system. In 
chapter 14, Acquiring Knowledge and Skill, ‘‘sensing’’ merits nine pages 
and ‘‘perceiving’’ three. The latter type of organization may be de- 
fensible on the ground that sensing is essential to learning; yet it seems 
a bit illogical, if not misleading, to treat sense qualities in a chapter 
bearing this title. In addition to chapter 3 listed above, Part I, ‘‘The 
Nature of Psychology’’ (66 pages), contains two additional chapters, 
Psychology in Everyday Living, the introductory chapter, and Psychology 
as an Emerging Science. 
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Part II, ‘‘Understanding Ourselves’’ (164 pages), contains the fol- 
lowing chapter headings: Personality, Psychology of Adjustment, Human 
Motivation, Our Emotional Life: Its Genesis and Control, Intelligence 
and Special Aptitudes, and Individual Differences. The last two, as well 
as the chapter entitled Psychology of Adjustment, are especially good; 
their treatment is unusually cogent and, in the case of the last two, ably 
supported by experimental data. 

Part ITI, ‘‘ Adjustment to College and Life’’ (75 pages), consists of the 
following three chapters: Social Behavior and Adjustment, Choosing a 
Vocation, and Psychology and Marriage. The writer feels that this sec- 
tion of the book is, in comparison with the others, rather weak. There is, 
of course, an obvious limitation in suitable material in this field. The 
authors responsible for this section have elsewhere in the text produced 
some of the ablest chapters. 

Part IV, ‘‘ Learning and Study’’ (83 pages), is made up of five chap- 
ters as follows: The General Nature of Learning, Acquiring Knowledge . 
and Skill, The Acquisition of Attitudes, Learning to Study, and Problem- 
Solving and Creative Learning. 

Part V, ‘‘ Applications of Psychology’’ (91 pages), contains four 
chapters with the following titles: Psychology in Business and Industry, 
Psychology in Law, Politics, and Religion, Psychology in Medicine, and 
Psychology in Education. 

The book contains half-tones of twenty eminent psychologists. Ten of 
these appear on coated stock on two pages opposite the title page, and 
ten appear in similar manner following page 4. 

This book has the merit of being easily read. In this respect it excels 
most elementary texts. Neither the organization nor the style appears to 
have suffered from joint authorship, although the organization of the 
book is not one of its strong points. The reviewer feels that the authors 
have succeeded in a large measure in attaining their objectives. They 
have employed the available material to good advantage. That careful 
study of the book will be helpful to college students, most psychologists 
will probably agree. Perhaps many will agree that it will be more help- 
ful than a study of the conventional text. Whether or not the content is 
adequate for the introductory course in psychology is another matter. 
Certainly it compares unfavorably with the standard text in the field 
with respect to what we have come to regard as the solid subject matter 
of psychology. 

That the subject matter of the book under consideration has a useful 
place in the college curriculum the reviewer should agree. What its rela- 
tionship to the introductory course in psychology should be, teachers of 
psychology will have to answer. This book and others like it may set the 
pattern for future courses, or they may find a useful place as collateral 
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reading in the introductory courses or may be incorporated into the sub- 
ject matter of general orientation courses. We come back to the question 
of what psychology should be and who is to study it. Is it to the best 
interest of psychology to adapt it to popular demand? The student will 
probably gain little insight into psychological methodology or little appre- 
ciation of the scientific aspects of psychology from a study of a book of 
this kind. Moreover, one wonders about the advisability of devoting so 
little space to sensation and perception, the fields which have been de- 
veloped more nearly to scientific status than have any of the others. If 
the psychologist feels that the introductory course should give a simple 
exposition of the fact and theory of psychology with emphasis upon its 
scientific aspects, he will be nettled by such omissions and will be unre- 
ceptive to such a chapter as Psychology and Marriage. 
J. B. Stroup, 
State University of Iowa. 
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